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NOVEL MOLECULES OF THE TANGO- 77 RELATED PROTEIN 

FAMILY AND USES THEREOF 

Background of the Invention 
The polypeptide cytokine interleukin-1 (IL-1) 
5 is a critical mediator of inflammatory and overall immune 
response. To date, three members of the IL-1 family, 
IL-lce, IL-1/? and IL-lra (Interleukin-1 receptor 
antagonist) have been isolated and cloned. IL-lc* and 
IL-1/3 are proinflammatory cytokines which elicit 

10 biological responses, whereas IL-lra is an antagonist of 
IL-la and IL-ljS activity. Two distinct cell-surface 
receptors have been identified for these ligands, the 
type 1 IL-1 receptor (IL-lRtI) and type II IL-1 receptor 
(IL-lRtll) . Recent results suggest that the IL-lRtI is 

15 the receptor responsible for transducing a signal and 
producing biological effects. 

As mentioned above, IL-1 is a key mediator of the 
host inflammatory response. While inflammation is an 
important homeostatic mechanism, aberrant inflammation 

20 has the potential for inducing damage to the host. 

Elevated IL-1 levels are known to be associated with a 
number of diseases particularly autoimmune diseases and 
inflammatory disorders. 

Since Il-lra is a naturally occurring inhibitor of 

25 IL-1, IL-lra can be used to limit the aberrant and 

potentially deleterious effects of IL-1. In experimental 
animals, pretreatment with IL-lra has been shown to 
prevent death resulting from lipopolysaccharide- induced 
sepsis. The relative absence of IL-lra has also been 

30 suggested to play a role in human inflammatory bowel 
disease . 



Summary of the Invention 
The present invention is based, at least in part, 
on the discovery of a gene encoding Tango-77, a secreted 
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protein that is predicted to be a member of the cytokine 
superfamily. The Tango-77 cDNA described below (SEQ ID 
NO:l) has three possible open reading frames. The first 
potential open reading frame encompasses 534 nucleotides 
5 extending from nucleotide 356 to nucleotide 889 of SEQ ID 
NO:l (SEQ ID NO: 3) and encodes a 178 amino acid protein 
(SEQ ID NO: 2) . This protein may include a predicted 
signal sequence of about 63 amino acids (from about amino 
acid 1 to about amino acid 63 of SEQ ID NO: 2 (SEQ ID 
10 NO: 4) and a predicted mature protein of about 115 amino 
acids (from about amino acid 64 to amino acid 178 of SEQ 
ID NO: 2 (SEQ ID NO: 5) ) . 

The second potential open reading frame 
encompasses 498 nucleotides extending from nucleotide 389 

15 to nucleotide 889 of SEQ ID NO:l (SEQ ID NO: 6) and 
encodes a 167 amino acid protein (SEQ ID NO: 7) . This 
protein may include a predicted signal sequence of about 
52 amino acids (from about amino acid 1 to about amino 
acid 52 of SEQ ID NO:7 (SEQ ID NO:8)) and a predicted 

20 mature protein of about 115 amino acids (from about amino 
acid 52 to amino acid 167 of SEQ ID NO: 7 (SEQ ID NO: 9)) . 

The third potential open reading frame encompasses 
408 nucleotides extending from nucleotide 481 to 
nucleotide 889 of SEQ ID NO:l (SEQ ID NO: 10) and encodes 

25 a 136 amino acid protein (SEQ ID NO: 11) . This protein 
includes a predicted signal sequence of about 21 amino 
acids (from about amino acid 1 to about amino acid 21 of 
SEQ ID NO: 11 (SEQ ID NO: 12)) and a predicted mature 
protein of about 115 amino acids (from about amino acid 

30 22 to amino acid 136 of SEQ ID N0:11 (SEQ ID NO:13)). 

As used herein, the terms "Tango-77", "Tango-77 
protein", "Tango-77 polypeptide" amd the like, can refer 
and polypeptide produced by the cDNA of SEQ ID NO:l 
including any and all of the Tango-77 gene products 

35 described above. 
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Tango-77 is expected to inhibit inflammation and 
play a functional role similar to that of secreted 
IL-lra. For example, it is expected that Tango-77 may 
bind to the IL-1 receptor, thus blocking receptor 
5 activation by inhibiting the binding of IL-la and IL-1/3 
to the receptor. Alternatively, Tango-77 may inhibit 
inflammation through another pathway, for example, by 
binding to a novel receptor. Accordingly, Tango-77 may 
be useful as a modulating agent in regulating a variety 
10 of cellular processes including acute and chronic 

inflammation, e.g., asthma, chronic myelogenous leukemia, 
rheumatoid arthritis, psoriasis and inflammatory bowel 
disease . 

In one aspect, the invention provides isolated 

15 nucleic acid molecules encoding Tango-77 or biologically 
active portions thereof, as well as nucleic acid 
fragments suitable as primers or hybridization probes for 
the detection of Tango-77. 

The invention encompasses methods of diagnosing 

20 and treating patients who are suffering from a disorder 
associated with an abnormal level (undesirably high or 
undesirably low) of inflammation, abnormal activity of 
the IL-1 receptor complex, or abnormal activity of IL-1, 
by administering a compound that modulates the expression 

25 of Tango-77 (at the DNA, mRNA or protein level, e.g., by 
altering mRNA splicing) or by altering the activity of 
Tango-77. Examples of such compounds include small 
molecules, antisense nucleic acid molecules, ribozymes, 
and polypeptides. 

30 The invention features a nucleic acid molecule 

which is at least 45% (e.g., 55%, 65%, 75%, 85%, 95%, or 
98%) identical to the nucleotide sequence shown in SEQ ID 
NO:l, SEQ ID N0:3, SEQ ID N0:6, SEQ ID NO:10, the 
nucleotide sequence of the cDNA insert of the plasmid 
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deposited with ATCC as Accession Number (the "cDNA of 
ATCC 98807") , or a complement thereof . 

The invention features a nucleic acid molecule 
which includes a fragment of at least 100 (e.g., 250, 
5 325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 
800, 900, or 989) nucleotides of the nucleotide sequence 
shown in SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID 
NO: 10, the nucleotide sequence of the cDNA ATCC 98807, or 
a complement thereof. 

io The invention also features a nucleic acid 

molecule which includes a nucleotide sequence encoding a 
protein having an amino acid sequence that is at least 
45% (55%, 65%, 75%, 85%, 95%, or 98%) identical to the 
amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 5, SEQ ID 

15 NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, or the 
amino acid sequence encoded by the cDNA of ATCC 98807. 

In a preferred embodiment, a Tango- 77 nucleic acid 
molecule has the nucleotide sequence shown in SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:6, SEQ ID NO:10 or the 

20 nucleotide sequence of the cDNA of ATCC 98807. 

Also within the invention is a nucleic acid 
molecule which encodes a fragment of a polypeptide having 
the amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ 
ID NO:5, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID 

25 NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, wherein the fragment 
includes at least 15 (e.g., 25, 30, 50, 100, 150, or 178) 
contiguous amino acids of SEQ ID NO: 2, SEQ ID NO: 4, SEQ 
ID NO:5, SEQ ID NO:7, SEQ ID N0:8, SEQ ID NO:9, SEQ ID 
NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or the polypeptide 

30 encoded by the cDNA of ATCC Accession Number 98807. 

The invention includes a nucleic acid molecule 
which encodes a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:8, 

35 SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, or 
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an amino acid sequence encoded by the cDNA of ATCC 
Accession Number 98807, wherein the nucleic acid molecule 
hybridizes to a nucleic acid molecule comprising SEQ ID 
NO:l, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID NO: 10, or a 
5 complement thereof under stringent conditions. 

Also within the invention are: an isolated 
Tango- 77 protein having an amino acid sequence that is at 
least about 45%, preferably 65%, 75%, 85%, 95%, or 98% 
identical to the amino acid sequence of SEQ ID NO: 5, SEQ 

10 ID NO: 9 or SEQ ID NO:13 (mature human Tango- 77 ) , or the 
amino acid sequence of SEQ ID NO:2, SEQ ID NO: 7 or SEQ ID 
NO: 11 (immature human Tango- 77) . 

Also within the invention are: an isolated 
Tango- 77 protein which is encoded by a nucleic acid 

is molecule having a nucleotide sequence that is at least 
about 65%, preferably 75%, 85%, or 95% identical to SEQ 
ID NO: 3, SEQ ID NO: 6, SEQ ID NO: 10 or the cDNA of ATCC 
98807; and an isolated Tango- 77 protein which is encoded 
by a nucleic acid molecule having a nucleotide sequence 

20 which hybridizes under stringent hybridization conditions 
to a nucleic acid molecule having the nucleotide sequence 
of SEQ ID NO: 3, SEQ ID NO : 6 , SEQ ID NO: 10, the non-coding 
strand of the cDNA of ATCC 98807, or the complement 
thereof . 

25 Also within the invention is a polypeptide which 

is a naturally occurring allelic variant of a polypeptide 
that includes the amino acid sequence of SEQ ID NO:2, SEQ 
ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID 
NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or an 

30 amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with ATCC as Accession Number 98807, 
wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid molecule 
comprising SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID 
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NO: 10 or the complement thereof under stringent 
conditions . 

Another embodiment of the invention features 
Tango-77 nucleic acid molecules which specifically detect 
5 Tango-77 nucleic acid molecules relative to nucleic acid 
molecules encoding other members of the cytokine 
superfamily. For example, in one embodiment, a Tango-77 
nucleic acid molecule hybridizes under stringent 
conditions to a nucleic acid molecule comprising the 

10 nucleotide sequence of SEQ ID N0:1, SEQ ID NO: 3, SEQ ID 
NO: 6, SEQ ID NO: 10, the cDNA of ATCC 98807, or a 
complement thereof. In another embodiment, the Tango-77 
nucleic acid molecule is at least 300 (325, 350, 375, 
400, 425, 450, 500, 550, 600, 650, 700, 800, 900, or 989) 

is nucleotides in length and hybridizes under stringent 
conditions to a nucleic acid molecule comprising the 
nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO: 3, 
SEQ ID NO: 6, SEQ ID NO: 10, the cDNA of ATCC 98807, or a 
complement thereof- In yet another embodiment, the 

20 invention provides an isolated nucleic acid molecule 
which is ant i sense to the coding strand of a Tango-77 
nucleic acid. 

Another aspect of the invention provides a vector, 
e.g., a recombinant expression vector, comprising a 

25 Tango-77 nucleic acid molecule of the invention. In 
another embodiment, the invention provides a host cell 
containing such a vector. The invention also provides a 
method for producing Tango-77 protein by culturing, in a 
suitable medium, a host cell of the invention containing 

30 a recombinant expression vector such that a Tango-77 
protein is produced. 

Another aspect of this invention features isolated 
or recombinant Tango-77 proteins and polypeptides. 
Preferred Tango-77 proteins and polypeptides possess at 

35 least one biological activity possessed by naturally 
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occurring human Tango-77, e.g., (i) the ability to 
interac'tT with proteins in the Tango-77 signalling pathway 
(ii) the ability to interact with a Tango-77 ligand or 
receptor; or (iii) the ability to interact with an 
5 intracellular target protein, (iv) the ability to 

interact with a protein involved in inflammation and (v) 
the ability to bind the IL-1 receptor. Other activities 
include the induction and suppression of polypeptide 
interleukins, cytokines and growth factors. 

10 The Tango-77 proteins of the present invention, or 

biologically active portions thereof, can be operably 
linked to a non-Tango-77 polypeptide (e.g., heterologous 
amino acid sequences) to form Tango-77 fusion proteins. 
The invention further features antibodies that 

is specifically bind Tango-77 proteins, such as monoclonal 
or polyclonal antibodies. In addition, the Tango-77 
proteins or biologically active portions thereof can be 
incorporated into pharmaceutical compositions, which 
optionally include pharmaceutically acceptable carriers. 

20 In another aspect, the present invention provides 

a method for detecting the presence of Tango-77 activity 
or expression in a biological sample by contacting the 
biological sample with an agent capable of detecting an 
indicator of Tango-77 activity or expression such that 

25 the presence of Tango-77 activity or expression is 
detected in the biological sample. 

In another aspect, the invention provides a method 
for modulating Tango-77 activity comprising contacting a 
cell with an agent that modulates (inhibits or 

30 stimulates) 

Tango-77 activity or expression such that Tango-77 
activity or expression in the cell is modulated. In one. 
embodiment, the agent is an antibody that specifically 
binds to Tango-77 protein. In another embodiment, the 
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agent modulates expression of Tango- 77 by modulating 
transcription of a Tango-77 gene, splicing of a Tango-77 
mRNA, or translation of a Tango-77 mRNA. In yet another 
embodiment, the agent is a nucleic acid molecule having a 
5 nucleotide sequence that is antisense to the coding 
strand of the Tango-77 mRNA or the Tango-77 gene. 

In one embodiment, the methods of the present 
invention are used to treat a subject having a disorder 
characterized by aberrant Tango-77 protein activity or 

10 nucleic acid expression by administering an agent which 
is a Tango-77 modulator to the subject. In one 
embodiment, the Tango-77 modulator is a Tango-77 protein. 
In another embodiment, the Tango-77 modulator is a 
Tango-77 nucleic acid molecule. In other embodiments, 

is the Tango-77 modulator is a peptide, peptidomimetic, or 
other small molecule. In a preferred embodiment, the 
disorder characterized by aberrant Tango-77 protein or 
nucleic acid expression can include chronic and acute 
inflammation. 

20 The present invention also provides a diagnostic 

assay for identifying the presence or absence of a 
genetic lesion or mutation characterized by at least one 
of: (i) aberrant modification or mutation of a gene 
encoding a Tango-77 protein; (ii) mis -regulation of a 

25 gene encoding a Tango-77 protein; and (iii) aberrant 
post-translational modification of a Tango-77 protein, 
wherein a wild- type form of the gene encodes a protein 
with a Tango-77 activity. 

In another aspect, the invention provides a 

30 method for identifying a compound that binds to or 
modulates the activity of a Tango-77 protein. In 
general, such methods entail measuring a biological 
activity of a Tango-77 protein in the presence and 
absence of a test compound and identifying those 



WO 99/06426 



PCT/US98/16102 



- 9 - 

compounds which alter the activity of the Tango-77 
protein. 

The invention also features methods for 
identifying a compound which modulates the expression of 
5 Tango-77 by measuring the expression of Tango-77 in the 
presence and absence of a compound . 

Other features and advantages of the invention 
will be apparent from the following detailed description 
and claims. 

10 Brief Description of the Drawings 

Figure 1 depicts the cDNA sequence (SEQ ID NO:l) 
of Tango-77. The Tango-77 cDNA has three possible open 
reading frames which encode the amino acid sequence (SEQ 
ID NO:2, SEQ ID NO:7 and SEQ ID NO: 11) of human Tango-77. 

is The three potential open reading frames of SEQ ID NO:l 
extend from: (1) nucleotide 356 to nucleotide 889 (SEQ ID 
NO:3); (2) nucleotide 389 to nucleotide 889 (SEQ ID 
NO:6); and (3) nucleotide 481 to nucleotide 889 (SEQ ID 
NO: 10) . 

20 Figure 2 depicts an alignment of an amino acid 

sequence of Tango-77 (T77; SEQ ID NO:2) with IL-1RA (SEQ 
ID NO:14), and IL-l/J (SEQ ID NO: 15). 

Figure 3 depicts the genomic sequence of BAC1 (SEQ 
ID NO: 16) . 

25 Figure 4 depicts the genomic sequence of BAC2 (SEQ 

ID NO:17) . 

Figure 5 depicts an amino acid sequence of an 
alternatively spliced form of Tango-77 (SEQ ID NO:2) as 
predicted by Procrustes (T77-procrustes; SEQ ID NO: 18) . 
30 . Figure 6 depicts an alignment of an amino acid 

sequence of an alternatively spliced form of Tango-77 
(T77-procrustes; SEQ ID NO: 18) with Tango-77 (SEQ ID 
NO:2) . 
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Figure 7 depicts an alignment of an amino acid 
sequence of an alternatively spliced form of Tango- 77 
(T77-procrustes; SEQ ID NO: 18) with IL-lra (SEQ ID 
NO:14), and IL-ljS (SEQ ID NO:15). 

5 Detailed Description of the Invention 

The present invention is based on the discovery of 
a cDNA molecule encoding human Tango- 77, a member of the 
cytokine superfamily. The cDNA molecule encoding human 
Tango- 77 has three possible open reading frames. The 

10 three possible nucleotide open reading frames for human 
Tango-77 protein are shown in Figure 1 (SEQ ID NO: 3, SEQ 
ID NO: 6 and SEQ ID NO: 10). The predicted amino acid 
sequence for the three possible Tango-77 immature 
proteins are also shown in 

15 Figure 1 (SEQ ID NO:2, SEQ ID NO: 7 or SEQ ID NO: 11) and 
three possible mature proteins are also shown in Figure 1 
(SEQ ID NO: 5, SEQ ID NO: 9 and SEQ ID NO: 13) . 

The Tango-77 cDNA of Figure 1 (SEQ ID NO:l) , which 
is approximately 989 nucleotides long including 

20 untranslated regions, encodes a protein amino acid having 
a molecular weight of approximately 19 kDa, 18 kDa, or 
14.9 KDa (excluding post-translational modifications) and 
the possible mature form of the protein has a molecular 
weight of 13 kDa. A plasmid containing a cDNA encoding 

25 human Tango-77 (with the cDNA insert name of Of fthx077) 
was deposited with American Type Culture Collection 
(ATCC) , 10801 University Boulevard, Manassas, Virginia 
20110-2209 on July 2, 1998 and assigned Accession Number 
98807. This deposit will be maintained under the terms 

30 of the Budapest Treaty on the International Recognition 
of the Deposit of Microorganisms for the Purposes of 
Patent Procedure. This deposit was made merely as a 
convenience for those of skill in the art and is not an 
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admission that a deposit is required under 3 5 U.S.C. 
§112 . 

Human Tango-77 is one member of a family of 
molecules (the "Tango-77 family") having certain 
5 conserved structural and functional features. The term 
"family, " when referring to the protein and nucleic acid 
molecules of the invention, is intended to mean two or 
more proteins or nucleic acid molecules having a common 
structural domain and having sufficient amino acid or 

10 nucleotide sequence identity as defined herein. Such 

family members can be naturally occurring and can be from 
either the same or different species. For example, a 
family can contain a first protein of human origin and a 
homologue of that protein of murine origin, as well as a 

is second, distinct protein of human origin and a murine 
homologue of that protein. Members of a family may also 
have common functional characteristics. 

As used interchangeably herein a "Tango-77 
activity", "biological activity of Tango-77" or 

20 "functional activity of Tango-77", refers to an activity 
exerted by a Tango-77 protein, polypeptide or nucleic 
acid molecule on a Tango-77 responsive cell as determined 
in vivo, or in vitro, according to standard techniques. 
A Tango-77 activity can be a direct activity, such as an 

25 association with a second protein, or an indirect 

activity, such as a cellular signaling activity mediated 
by interaction of the Tango-77 protein with a second 
protein. In a preferred embodiment, a Tango-77 activity 
includes at least one or more of the following 

30 activities: (i) the ability to interact with proteins in 
the Tango-77 signalling pathway (ii) the ability to 
interact with a Tango-77 ligand or receptor; or (iii) the 
ability to interact with an intracellular target protein, 
(iv) the ability to interact with a protein involved in 
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inflammation, and (v) the ability to bind the IL-1 
receptor. 

Accordingly, another embodiment of the invention 
features isolated Tango- 77 proteins and polypeptides 
5 having a Tango- 77 activity. 

Yet another embodiment of the invention features 
Tango-77 molecules which contain a signal sequence. 
Generally, a signal sequence (or signal peptide) is a 
peptide containing about 21 to 63 amino acids which 

10 occurs at the extreme N-terminal end of a secretory 
protein. The native Tango-77 signal sequence (SEQ ID 
NO: 4, SEQ ID NO: 8, or SEQ ID NO: 12) can be removed and 
replaced with a signal sequence from another protein. In 
certain host cells (e.g., mammalian host cells), 

is expression and/or secretion of Tango-77 can be increased 
through use of a heterologous signal sequence. For 
example, the gp67 secretory sequence of the baculovirus 
envelope protein can be used as a heterologous signal 
sequence. Alternatively, the native Tango-77 signal 

20 sequence can itself be used as a heterologous signal 
sequence in expression systems, e.g., to facilitate the 
secretion of a protein of interest. 

Various aspects of the invention are described in 
further detail in the following subsections. 

25 I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated 
nucleic acid molecules that encode Tango-77 proteins or 
biologically active portions thereof, as well as nucleic 
acid molecules sufficient for use as hybridization probes 

30 to identify Tango- 77 -encoding nucleic acids (e.g., 

Tango-77 mRNA) and fragments for use as PGR primers for 
the amplification or mutation of Tango-77 nucleic acid 
molecules. As used herein, the term "nucleic acid 
molecule" is intended to include DNA molecules (e.g., 
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cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and 
analogs of the DNA or RNA generated using nucleotide 
analogs. The nucleic acid molecule can be single- 
stranded or double -stranded, but preferably is double- 
5 stranded DNA. 

An "isolated" nucleic acid molecule is one which 
is separated from other nucleic acid molecules which are 
present in the natural source of the nucleic acid. 
Preferably, an "isolated" nucleic acid is free of 

10 sequences (preferably protein encoding sequences) which 
naturally flank the nucleic acid (i.e., sequences located 
at the 5' and 3' ends of the nucleic acid) in the genomic 
DNA of the organism from which the nucleic acid is 
derived. For example, in various embodiments, the 

is isolated Tango-77 nucleic acid molecule can contain less 
than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb 
of nucleotide sequences which naturally flank the nucleic 
acid molecule in genomic DNA of the cell from which the 
nucleic acid is derived. Moreover, an "isolated" nucleic 

20 acid molecule, such as a cDNA molecule, can be 

substantially free of other cellular material, or culture 
medium when produced by recombinant techniques, or 
substantially free of chemical precursors or other 
chemicals when chemically synthesized. 

25 A nucleic acid molecule of the present invention, 

e.g., a nucleic acid molecule having the nucleotide 
sequence of SEQ ID N0:1, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID 
NO: 10, the cDNA of ATCC 98807, or a complement of any of 
these nucleotide sequences, can be isolated using 

30 standard molecular biology techniques and the sequence 
information provided herein. Using all or a portion of 
the nucleic acid sequences of SEQ ID NO:l, SEQ ID NO: 3, 
SEQ ID NO: 6, SEQ ID NO: 10, the cDNA of ATCC 98807, or the 
complement thereof as a hybridization probe, Tango-77 

35 nucleic acid molecules can be isolated using standard 
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hybridization and cloning techniques (e.g., as described 
in Sambrook et al . , eds., Molecular Cloning: A 
Laboratory Manual, 2nd ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold 
5 Spring Harbor, NY, 1989) . 

A nucleic acid of the invention can be amplified 
using cDNA, mRNA or genomic DNA as a template and 
appropriate oligonucleotide primers according to standard 
PCR amplification techniques. The nucleic acid so 

10 amplified can be cloned into an appropriate vector and 
characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to Tango- 77 nucleotide 
sequences can be prepared by standard synthetic 
techniques, e.g., using an automated DNA synthesizer. 

15 In another preferred embodiment, an isolated 

nucleic acid molecule of the invention comprises a 
nucleic acid molecule which is a complement of the 
nucleotide sequence shown in SEQ ID N0:1, SEQ ID NO: 3, 
SEQ ID NO: 6, SEQ ID NO: 10 the cDNA of ATCC 98807, or a 

20 portion thereof. A nucleic acid molecule which is 

complementary to a given nucleotide sequence is one which 
is sufficiently complementary to the given nucleotide 
sequence that it can hybridize to the given nucleotide 
sequence thereby forming a stable duplex. 

25 Moreover, the nucleic acid molecule of the 

invention can comprise only a portion of a nucleic acid 
sequence encoding Tango- 77, for example, a fragment which 
can be used as a probe or primer or a fragment encoding a 
biologically active portion of Tango-77. The nucleotide 

30 sequence determined from the cloning of the human 

Tango-77 gene allows for the generation of probes and 
primers designed for use in identifying and/or cloning 
Tango-77 homologues in other cell types, e.g., from other 
tissues, as well as Tango-77 homologues from other 

35 mammals. The probe/primer typically comprises 



WO 99/06426 PO7US98/16102 

- 15 - 

substantially purified oligonucleotide. The 
oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent 
conditions to at least about 12, preferably about 25, 
5 more preferably about 50, 75, 100, 125, 150, 175, 200, 
250, 300, 350 or 400 consecutive nucleotides of the sense 
or ant i -sense sequence of SEQ ID N0:1, SEQ ID NO: 3, SEQ 
ID NO: 6, SEQ ID NO: 10, or the cDNA of ATCC 98807. 
Alternatively, the oligonucleotide can typically comprise 

10 a region of nucleotide sequence that hybridizes under 
stringent conditions to at least about 12, preferably 
about 25, more preferably about 50, 75, 100, 125, 150, 
175, 200, 250, 300, 350 or 400 consecutive nucleotides of 
the sense or ant i- sense sequence of a naturally occurring 

15 mutant of SEQ ID N0:1, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID 
NO: 10, or the cDNA of ATCC 98807. 

Probes based on the human Tango-77 nucleotide 
sequence can be used to detect transcripts or genomic 
sequences encoding the same or identical proteins. The 

20 probe comprises a label group attached thereto, e.g., a 
radioisotope, a fluorescent compound, an enzyme, or an 
enzyme co-factor. Such probes can be used as a part of a 
diagnostic test kit for identifying cells or tissues 
which mis-express a Tango-77 protein, such as by 

25 measuring a level of a Tango- 77 -encoding nucleic acid in 
a sample of cells from a subject, e.g., detecting 
Tango-77 mRNA levels or determining whether a genomic 
Tango-77 gene has been mutated or deleted. 

A nucleic acid fragment encoding a "biologically 

30 active portion of Tango-77" can be prepared by isolating 
a portion of SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 6, SEQ 
ID NO: 10 or the nucleotide sequence of the cDNA of ATCC 
98807 which encodes a polypeptide having a Tango-77 
biological activity, expressing the encoded portion of 

35 Tango-77 protein (e.g., by recombinant expression in 
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vitro) and assessing the activity of the encoded portion 
of Tango- 77. 

The invention further encompasses nucleic acid 
molecules that differ from the nucleotide sequence of SEQ 
5 ID NO : 1 , SEQ ID NO:3, SEQ ID NO: 6, SEQ ID NO: 10, or the 
cDNA of ATCC 98807 due to degeneracy of the genetic code 
and thus encode the same Tango-77 protein as that encoded 
by the nucleotide sequence shown in SEQ ID NO:l, SEQ ID 
NO: 3, SEQ ID NO: 6, SEQ ID NO: 10, or the cDNA of ATCC 
10 98807. 

In addition to the human Tango-77 nucleotide 
sequence shown in SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 6, 
SEQ ID NO: 10, or the cDNA of ATCC 98807, it will be 
appreciated by those skilled in the art that DNA sequence 

is polymorphisms that lead to changes in the amino acid 
sequences of Tango-77 may exist within a population 
(e.g., the human population). Such genetic polymorphism 
in the Tango-77 gene may exist among individuals within a 
population due to natural allelic variation. An allele 

20 is one of a group of genes which occur alternatively at a 
given genetic locus. As used herein, the term "allelic 
variant" refers to a nucleotide sequence which occurs at 
a Tango-77 locus or to a polypeptide encoded by the 
nucleotide sequence. As used herein, the terms "gene" 

25 and "recombinant gene" refer to nucleic acid molecules 
comprising an open reading frame encoding a Tango-77 
protein, preferably a mammalian Tango-77 protein. Such 
natural allelic variations can typically result in 1-5% 
variance in the nucleotide sequence of the Tango-77 gene. 

30 Alternative alleles can be identified by sequencing the 
gene of interest in a number of different individuals. 
This can be readily carried out by using hybridization 
probes to identify the same genetic locus in a variety of 
individuals. Any and all such nucleotide variations and 

35 resulting amino acid polymorphisms or variations in 
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Tango- 7 7 that are the result of natural allelic variation 
and that do not alter the functional activity of Tango-77 
are intended to be within the scope of the invention. 

Moreover, nucleic acid molecules encoding Tango- 77 
5 proteins from other species (Tango- 77 homologues) , which 
have a nucleotide sequence which differs from that of a 
human Tango- 77, are intended to be within the scope of 
the invention. Nucleic acid molecules corresponding to 
natural allelic variants and homologues of the Tango- 77 

10 cDNA of the invention can be isolated based on their 
identity to the human Tango-77 nucleic acids disclosed 
herein using the human cDNAs, or a portion thereof, as a 
hybridization probe according to standard hybridization 
techniques under stringent hybridization conditions. 

is Accordingly, in another embodiment, an isolated 

nucleic acid molecule of the invention is at least 300 
(325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 
800, or 989) nucleotides in length and hybridizes under 
stringent conditions to the nucleic acid molecule 

20 comprising the nucleotide sequence, preferably the coding 
sequence, Of SEQ ID N0;1, SEQ ID NO:3, SEQ ID NO: 6, SEQ 
ID NO: 10, or the cDNA of ATCC 98807. 

As used herein, the term "hybridizes under 
stringent conditions" is intended to describe conditions 

25 for hybridization and washing under which nucleotide 
sequences at least 60% (65%, 70%, preferably 75%) 
identical to each other typically remain hybridized to 
each other. Such stringent conditions are known to those 
skilled in the art and can be found in Current Protocols 

30 in Molecular Biology, John Wiley & Sons, N.Y. (1989) , 
6.3.1-6.3.6. A preferred, non-limiting example of 
stringent hybridization conditions are hybridization in 
6X sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by one or more washes in 0.2X SSC, 0.1% SDS at 

35 50-65°C. Preferably, an isolated nucleic acid molecule 
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of the invention that hybridizes under stringent 
conditions to the sequence of SEQ ID NO:!, SEQ ID NO: 3, 
SEQ ID NO: 6, SEQ ID NO: 10, the cDNA of ATCC 98807, or the 
complement thereof, corresponds to a naturally-occurring 

5 nucleic acid molecule. As used herein, a "naturally- 
occurring" nucleic acid molecule refers to an RNA or DNA 
molecule having a nucleotide sequence that occurs in 
nature (e.g., encodes a natural protein). 

In addition to naturally-occurring allelic 

10 variants of the Tango- 77 sequence that may exist in the 
population, the skilled artisan will further appreciate 
that changes can be introduced by mutation into the 
nucleotide sequence of SEQ ID NO:l, SEQ ID NO: 3, SEQ ID 
NO: 6, SEQ ID NO: 10 or the cDNA of ATCC 98807, thereby 

15 leading to changes in the amino acid sequence of the 

encoded Tango-77 protein, without altering the biological 
activity of the Tango- 77 protein. Amino acid residues 
that are not conserved or only semiconserved among Tango- 
77 of various species may be non-essential for activity 

20 and thus would likely be targets for alteration. 

Alternatively, one can make nucleotide substitutions 
leading to amino acid substitutions at "non-essential" 
amino acid residues. A "non-essential" amino acid 
residue is a residue that can be altered from the wild- 

25 type sequence of Tango-77 (e.g., the sequence of SEQ ID 
NO:2, SEQ ID N0:5, SEQ ID N0:7, SEQ ID NO:9, SEQ ID NO:ll 
or SEQ ID NO: 13) without altering the biological 
activity, whereas an "essential" amino acid residue is 
required for biological activity. For example, amino 

30 acid residues that are conserved among the Tango-77 

proteins of various species may be essential for activity 
and thus would not likely be targets for alteration, 
unless one wishes to reduce or alter Tango-77 activity. 
Accordingly, another aspect of the invention 

35 pertains to nucleic acid molecules encoding Tango-77 
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proteins that contain changes in amino acid residues that 
are not essential for activity. Such Tango- 77 proteins 
differ in amino acid sequence from SEQ ID NO: 2, SEQ ID 
NO:5, SEQ ID NO: 7, SEQ ID NO:9, SEQ ID NO: 11, or SEQ ID 
5 NO: 13 yet retain biological activity. In one embodiment, 
the isolated nucleic acid molecule includes a nucleotide 
sequence encoding a protein that includes an amino acid 
sequence that is at least about 45% identical, 65%, 75%, 
85%, 95%, or 98% identical to the amino acid sequence of 
10 SEQ ID NO:2, SEQ ID NO;5, SEQ ID NO:7, SEQ ID NO:9, SEQ 
ID NO: 11, or SEQ ID NO: 13. 

An isolated nucleic acid molecule encoding a 
Tango- 77 protein having a sequence which differs from 
that of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:7, SEQ ID 

15 NO: 9, SEQ ID NO: 11, or SEQ ID NO: 13 can be created by 
introducing one or more nucleotide substitutions, 
additions or deletions into the nucleotide sequence of 
SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:6, SEQ ID NO:10, or 
the cDNA of ATCC 98807 such that one or more amino acid 

20 substitutions, additions or deletions are introduced into 
the encoded protein. Mutations can be introduced by 
standard techniques, such as site-directed mutagenesis 
and PCR-mediated mutagenesis. Preferably, conservative 
amino acid substitutions are made at one or more 

25 predicted non-essential amino acid residues. A 

"conservative amino acid substitution" is one in which 
the amino acid residue is replaced with an amino acid 
residue having a similar side chain. Families of amino 
acid residues having similar side chains have been 

30 defined in the art. These families include amino acids 
with basic side chains (e.g., lysine, arginine, 
histidine) , acidic side chains (e.g., aspartic acid, 
glutamic acid), uncharged polar side chains (e.g., 
glycine, asparagine, glut amine, serine, threonine, 

35 tyrosine, cysteine), nonpolar side chains (e.g., alanine, 
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valine, leucine, isoleucine, proline, phenylalanine, 
methionine, tryptophan), beta-branched side chains (e.g., 
threonine, valine, isoleucine) and aromatic side chains 
(e.g., tyrosine, phenylalanine, tryptophan, histidine) . 

5 Thus, a predicted nonessential amino acid residue in 
Tango- 77 is preferably replaced with another amino acid 
residue from the same side chain family. Alternatively, 
mutations can be introduced randomly along all or part of 
a Tango- 77 coding sequence, such as by saturation 

10 mutagenesis, and the resultant mutants can be screened 
for Tango-77 biological activity to identify mutants that 
retain activity. Following mutagenesis, the encoded 
protein can be expressed recombinantly and the activity 
of the protein can be determined. 

is In a preferred embodiment, a mutant Tango-77 

protein can be assayed for: (1) the ability to form 
protein: protein interactions with proteins in the 
Tango-77 signalling pathway; (2) the ability to bind a 
Tango-77 ligand or receptor; or (3) the ability to bind 

20 to an intracellular target protein or (4) the ability to 
interact with a protein involved in inflammation or (5) 
the ability to bind the IL-1 receptor. In yet another 
preferred embodiment, a mutant Tango-77 can be assayed 
for the ability to modulate inflammation, asthma, 

25 autoimmune dieseases, and sepsis. 

The present invention encompasses antisense 
nucleic acid molecules, i.e., molecules which are 
complementary to a sense nucleic acid encoding a protein, 
e.g., complementary to the coding strand of a double- 

30 stranded cDNA molecule or complementary to an mRNA 
sequence. Accordingly, an antisense nucleic acid can 
hydrogen bond to a sense nucleic acid. The antisense 
nucleic acid can be complementary to an entire Tango-77 
coding strand, or to only a portion thereof, e.g., all or 

35 part of the protein coding region (or open reading 
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frame) . An antisense nucleic acid molecule can be 
antisense to a noncoding region of the coding strand of a 
nucleotide sequence encoding Tango-77. The noncoding 
regions ("5' and 3' untranslated regions 1 ') are the 5' and 
5 3' sequences which flank the coding region and are not 
translated into amino acids. 

Given the coding strand sequences encoding 
Tango-77 disclosed herein (e.g., SEQ ID NO: 3, SEQ ID NO: 5, 
or SEQ ID NO: 8), antisense nucleic acids of the invention 

10 can be designed according to the rules of Watson and 

Crick base pairing. The antisense nucleic acid molecule 
can be complementary to the entire coding region of 
Tango-77 mRNA, but more preferably is an oligonucleotide 
which is antisense to only a portion of the coding or 

15 noncoding region of Tango-77 mRNA. For example, the 
antisense oligonucleotide can be complementary to the 
region surrounding the translation start site of Tango-77 
mRNA, e.g., an oligonucleotide having the sequence 
5 ' - TGCAACTTTTACAGGAAACAC - 3' (SEQ ID NO:19) or 

20 5 ' -CCTCACTTTTACCCGAGACTC- 3 ' (SEQ ID NO: 20) or 

5 ' -GACGGGTGGTACTTAAAACAA-3 ' (SEQ ID NO: 21). An antisense 
oligonucleotide can be, for example, about 5, 10, 15, 20, 
25, 30, 35, 40, 45 or 50 nucleotides in length. An 
antisense nucleic acid of the invention can be 

25 constructed using chemical synthesis and enzymatic 
ligation reactions using procedures known in the art. 
For example, an antisense nucleic acid (e.g., an 
antisense oligonucleotide) can be chemically synthesized 
using naturally occurring nucleotides or variously 

30 modified nucleotides designed to increase the biological 
stability of the molecules or to increase the physical 
stability of the duplex formed between the antisense and 
sense nucleic acids, e.g., phosphorothioate derivatives 
and acridine substituted nucleotides can be used. 

35 Examples of modified nucleotides which can be used to 
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generate the antisense nucleic acid include 5- 
f luorouracil , 5-bromouracil , 5-chlorouracil , 
5- iodouracil , hypoxanthine , xanthine , 4 -acetyl cytosine , 
5- (carboxyhydroxylmethyl) uracil, 5- 
5 carboxymethylaminomethyl - 2 - thiouridine , 

5 - c a rboxyme t hy 1 am i nome t hy 1 ur a c i 1 , d i hy dr our a c i 1 , 
beta-D-galactosylqueosine, inosine , N6- 
isopentenyladenine, 1-methylguanine, 1 -methyl inosine , 
2 , 2 -dimethyl guanine, 2 -methyl adenine, 2-methylguanine, 3- 

10 methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil , 5- 
methoxyaminomethyl-2-thiouracil , beta-D-mannosylqueosine, 
5 ' -methoxycarboxymethyluracil , 5-methoxyuracil , 2- 
methylthio-N6-isopentenyladenine, uracil- 5 -oxyacetic acid 

15 (v) , wybutoxosine, pseudouracil, queosine, 2- 

thiocytosine , 5 -methyl -2 - thiouracil , 2 - thiouracil , 4 - 
thiouracil, 5-methyluracil , uracil -5 -oxyacetic acid 
methylester, uracil-5-oxyacetic acid (v) , 5-methyl-2- 
thiouracil, 3- (3-amino-3-N-2-carboxypropyl) uracil 

20 (acp3)w, and 2 , 6-diaminopurine . Alternatively, the 

antisense nucleic acid can be produced biologically using 
an expression vector into which a nucleic acid has been 
subcloned in an antisense orientation (i.e., RNA 
transcribed from the inserted nucleic acid will be of an 

25 antisense orientation to a target nucleic acid of 

interest, described further in the following subsection) . 

The antisense nucleic acid molecules of the 
invention are typically administered to a subject or 
generated in situ such that they hybridize with or bind 

30 to cellular mRNA and/or genomic DNA encoding a Tango-77 
protein to thereby inhibit expression of the protein, 
e.g., by inhibiting transcription and/or translation. 
The hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, 

35 in the case of an antisense nucleic acid molecule which 
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binds to DNA duplexes, through specific interactions in 
the major groove of the double helix. An example of a 
route of administration of antisense nucleic acid 
molecules of the invention includes direct injection at a 
5 tissue site. Alternatively, antisense nucleic acid 
molecules can be modified to target selected cells and 
then administered systemically . For example, for 
systemic administration, antisense molecules can be 
modified such that they specifically bind to receptors or 

10 antigens expressed on a selected cell surface, e.g., by 
linking the antisense nucleic acid molecules to peptides 
or antibodies which bind to cell surface receptors or 
antigens. The antisense nucleic acid molecules can also 
be delivered to cells using the vectors described herein. 

15 To achieve sufficient intracellular concentrations of the 
antisense molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the 
control of a strong pol II or pol III promoter are 
preferred. 

20 An antisense nucleic acid molecule of the 

invention can be an a-anomeric nucleic acid molecule. An 
of-anomeric nucleic acid molecule forms specific double- 
stranded hybrids with complementary RNA in which, 
contrary to the usual jS-units, the strands run parallel 

25 to each other (Gaultier et al. (1987) Nucleic Acids Res. 
15:6625-6641). The antisense nucleic acid molecule can 
also comprise a 2 ' -o-methylribonucleotide (Inoue et al . 
(1987) Nucleic Acids Res. 15:6131-6148) or a chimeric 
RNA- DNA analogue (Inoue et al. (1987) FEBS Lett. 215:327- 

30 330) . 

The invention also encompasses ribozymes . 
Ribozymes are catalytic RNA molecules with ribonuclease 
activity which are capable of cleaving a single-stranded 
nucleic acid, such as an mRNA, to which they have a 
35 complementary region. Thus, ribozymes (e.g., hammerhead 
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ribozymes (described in Haselhoff and Gerlach (1988) 
Mature 334:585-591)) can be used to catalytically cleave 
Tango- 7 7 mRNA transcripts to thereby inhibit translation 
of Tango- 77 mRNA. A ribozyme having specificity for a 
5 Tango- 77 -encoding nucleic acid can be designed based upon 
the nucleotide sequence of a Tango- 77 cDNA disclosed 
herein (e.g., SEQ ID NO:l, SEQ ID N0:3, SEQ ID HO:6 t SEQ 
ID NO: 10) . For example, a derivative of a Tetrahymena L- 
19 IVS RNA can be constructed in which the nucleotide 

10 sequence of the active site is complementary to the 

nucleotide sequence to be cleaved in a Tango- 7 7 -encoding 
mRNA. See, e.g., Cech et al . U.S. Patent No. 4,987,071; 
and Cech et al . U.S. Patent No. 5,116,742. 
Alternatively, Tango- 77 mRNA can be used to select a 

is catalytic RNA having a specific ribonuclease activity 
from a pool of RNA molecules. See, e.g., Bartel and 
Szostak (1993) Science 261:1411-1418. 

The invention also encompasses nucleic acid 
molecules which form triple helical structures. For 

20 example, Tango- 77 gene expression can be inhibited by 
targeting nucleotide sequences complementary to the 
regulatory region of the Tango-77 (e.g., the Tango-77 
promoter and/or enhancers) to form triple helical 
structures that prevent transcription of the Tango-77 

25 gene in target cells. See generally, Helene (1991) 

Anticancer Drug Des. 6 (6) :569-84; Helene (1992) Ann. N.Y. 
Acad. Sci. 660:27-36; and Maher (1992) Bioassays 
14 (12) :807-15. 

In preferred embodiments, the nucleic acid 

30 molecules of the invention can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, 
e.g., the stability, hybridization, or solubility of the 
molecule. For example, the deoxyribose phosphate 
backbone of the nucleic acids can be modified to generate 

35 peptide nucleic acids (see Hyrup et al. (1996) Bioorganic 
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& Medicinal Chemistry 4 (1) : 5-23). As used herein, the 
terms "peptide nucleic acids" or "PNAs" refer to nucleic 
acid mimics, e.g., DNA mimics, in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide 
backbone and only the four natural nucleobases are 
retained. The neutral backbone of PNAs has been shown to 
allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA 
oligomers can be performed using standard solid phase 
peptide synthesis protocols as described in Hyrup et al . 
(1996) supra; Perry-O' Keef e et al . (1996) Proc. Natl. 
Acad. Sci. USA 93: 14670-675. 

PNAs of Tango- 77 can be used in therapeutic and 
diagnostic applications. For example, PNAs can be used 
as antisense or antigene agents for sequence-specific 
modulation of gene expression by, e.g., inducing 
transcription or translation arrest or inhibiting 
replication. PNAs of Tango-77 can also be used, e.g., in 
the analysis of single base pair mutations in a gene by, 
e.g., PNA directed PCR clamping; as artificial 
restriction enzymes when used in combination with other 
enzymes, e.g., SI nucleases (Hyrup (1996) supra; or as 
probes or primers for DNA sequence and hybridization 
(Hyrup (1996) supra; Perry-O' Keef e et al . (1996) Proc. 
Natl. Acad. Sci. USA 93: 14670-675). 

In another embodiment, PNAs of Tango-77 can be 
modified, e.g., to enhance their stability or cellular 
uptake, by attaching lipophilic or other helper groups to 
PNA, by the formation of PNA -DNA chimeras, or by the use 
of liposomes or other techniques of drug delivery known 
in the art. For example, PNA -DNA chimeras of Tango-77 
can be generated which may combine the advantageous 
properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, e.g., RNAse H and DNA polymerases, 
to interact with the DNA portion while the PNA portion 
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would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of 
appropriate lengths selected in terms of base stacking, 
number of bonds between the nucleobases, and orientation 
5 (Hyrup (1996) supra) . The synthesis of PNA-DNA chimeras 
can be performed as described in Hyrup (1996) supra and 
Finn et al . (1996) Nucleic Acids Res. 24 (17) : 3357-63 . 
For example , a DNA chain can be synthesized on a solid 
support using standard phosphoramidite coupling chemistry 

10 and modified nucleoside analogs. Compounds such as 5'- 
(4-methoxytrityl) amino-5' -deoxy- thymidine phosphoramidite 
can be used as a link between the PNA and the 5' end of 
DNA (Mag et al . (1989) Nucleic Acid Res. 17:5973-88). 
PNA monomers are then coupled in a stepwise manner to 

is produce a chimeric molecule with a 5' PNA segment and a 
3' DNA segment (Finn et al. (1996) Nucleic Acids Res. 
24 (17) :3357-63) . Alternatively, chimeric molecules can 
be synthesized with a 5' DNA segment and a 3' PNA segment 
(Peterser et al. (1975) Bioorganic Med. Chem. Lett. 

20 5:1119-11124). 

In other embodiments, the oligonucleotide may 
include other appended groups such as peptides (e.g., for 
targeting host cell receptors in vivo) , or agents 
facilitating transport across the cell membrane (see, 

25 e.g., Letsinger et al. (1989) Proc. Natl. Acad. Sci. USA 
86:6553-6556; Lemaitre et al . (1987) Proc. Natl. Acad. 
Sci. USA 84:648-652; PCT Publication No. WO 88/09810) or 
the blood-brain barrier (see, e.g., PCT Publication No. 
W0 89/10134) . In addition, oligonucleotides can be 

30 modified with hybridization- triggered cleavage agents 
(see, e.g., Krol et al . (1988) Bio/rechnigues 6:958-976) 
or intercalating agents (see, e.g., Zon (1988) Pharm. 
Res. 5:539-549). To this end, the oligonucleotide may be 
conjugated to another molecule, e.g., a peptide, 
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hybridization triggered cross-linking agent, transport 
agent, hybridization- triggered cleavage agent, etc. 

II. Isolated Tango-77 Proteins and Anti-Tancro*-77 
Antibodies 

One aspect of the invention pertains to isolated 
Tango- 77 proteins, and biologically active portions 
thereof, as well as polypeptide fragments suitable for 
use as immunogens to raise anti-Tango-77 antibodies. In 
one embodiment, native Tango- 77 proteins can be isolated 
from cells or tissue sources by an appropriate 
purification scheme using standard protein purification 
techniques. In another embodiment, Tango-77 proteins are 
produced by recombinant DNA techniques. Alternative to 
recombinant expression, a Tango-77 protein or polypeptide 
can be synthesized chemically using standard peptide 
synthesis techniques. 

An "isolated" or "purified" protein or 
biologically active portion thereof is substantially free 
of cellular material or other contaminating proteins from 
the cell or tissue source from which the Tango-77 protein 
is derived, or substantially free of chemical precursors 
or other chemicals when chemically synthesized. The 
language "substantially free of cellular material 1 ' 
includes preparations of Tango-77 protein in which the 
protein is separated from cellular components of the 
cells from which it is isolated or recombinantly 
produced. Thus, Tango-77 protein that is substantially 
free of cellular material includes preparations of 
Tango-77 protein having less than about 30%, 20%, 10%, or 
5% (by dry weight) of non-Tango-77 protein (also referred 
to herein as a "contaminating protein") . When the 
Tango-77 protein or biologically active portion thereof 
is recombinantly produced, it is also preferably 
substantially free of culture medium, i.e., culture 
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medium represents less than about 20%, 10%, or 5% of the 
volume of the protein preparation. When Tango- 77 protein 
is produced by chemical synthesis, it is preferably 
substantially free of chemical precursors or other 
5 chemicals, i.e., it is separated from chemical precursors 
or other chemicals which are involved in the synthesis of 
the protein. Accordingly such preparations of Tango-77 
protein have less than about 30%, 20%, 10%, 5% (by dry 
weight) of chemical precursors or non-Tango-77 chemicals. 

io Biologically active portions of a Tango-77 protein 

include peptides comprising amino acid sequences 
sufficiently identical to or derived from the amino acid 
sequence of the Tango-77 protein (e.g., the amino acid 
sequence shown in SEQ ID NO:2, SEQ ID NO: 5, SEQ ID NO: 7, 

15 SEQ ID NO:9, SEQ ID N0:11, or SEQ ID NO:13), which 

include fewer amino acids than the full length Tango-77 
proteins, and exhibit at least one activity of a Tango-77 
protein. Typically, biologically active portions 
comprise a domain or motif with at least one activity of 

20 the Tango-77 protein. A biologically active portion of a 
Tango-77 protein can be a polypeptide which is, for 
example, 10, 25, 50, 100 or more amino acids in length. 

Moreover, other biologically active portions, in 
which other regions of the protein are deleted, can be 

25 prepared by recombinant techniques and evaluated for one 
or more of the functional activities of a native Tango-77 
protein. 

Preferred Tango-77 protein has the amino acid 
sequence shown of SEQ ID NO: 2, SEQ ID NO: 5, SEQ ID NO: 7, 
30 SEQ ID NO: 9, SEQ ID NO: 11, or SEQ ID NO: 13. Other useful 
Tango-77 proteins are substantially identical to SEQ ID 

NO:2, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID 
NO: 11, or SEQ ID NO: 13 and retain the functional activity 
of the protein of SEQ ID NO: 2, SEQ ID NO: 5, SEQ ID NO: 7, 
35 SEQ ID NO: 9, SEQ ID NO: 11, or SEQ ID NO: 13 yet differ in 
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amino acid sequence due to natural allelic variation or 
mutagenesis. Accordingly, a useful Tango-77 protein is a 
protein which includes an amino acid sequence at least 
about 45%, preferably 55%, 65%, 75%, 85%, 95%, or 99% 
5 identical to the amino acid sequence of SEQ ID NO: 2, SEQ 
ID NO: 5, SEQ ID NO : 7 , SEQ ID NO: 9, SEQ ID NO: 11, or SEQ 

ID NO: 13 and retains the functional activity of the 
Tango-77 proteins of SEQ ID NO: 2, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, or SEQ ID NO: 13. In a 

io preferred embodiment, the Tango-77 protein retains a 
functional activity of the Tango-77 protein of SEQ ID 
NO: 2, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID 
NO: 11, or SEQ ID NO: 13. 

To determine the percent identity of two amino 

is acid sequences or of two nucleic acids, the sequences are 
aligned for optimal comparison purposes (e.g., gaps can 
be introduced in the sequence of a first amino acid or 
nucleic acid sequence for optimal alignment with a second 
amino or nucleic acid sequence) . The amino acid residues 

20 or nucleotides at corresponding amino acid positions or 
nucleotide positions are then compared. When a position 
in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in 
the second sequence, then the molecules are identical at 

25 that position. The percent identity between the two 
sequences is a function of the number of identical 
positions shared by the sequences (i.e., % identity = # 
of identical positions/total # of positions, e.g., 
overlapping x 100) . Preferably, the two sequences are 

30 the same length. 

The determination of percent homology between two 
sequences can be accomplished using a mathematical 
algorithm. A preferred, non- limiting example of a 
mathematical algorithm utilized for the comparison of two 

35 sequences is the algorithm of Karlin and Altschul (1990) 
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Proc. Natl. Acad. Sci. USA 87:2264-2268, modified as in 
Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 
90:5873-5877. Such an algorithm is incorporated into the 
NBLAST and XBLAST programs of Altschul, et al . (1990) 
5 J. Mol. Biol. 215:403-410. BLAST nucleotide searches can 
be performed with the NBLAST program, score = 100, 
wordlength = 12 to obtain nucleotide sequences homologous 
to Tango- 77 nucleic acid molecules of the invention. 
BLAST protein searches can be performed with the XBLAST 

10 program, score = 50, wordlength = 3 to obtain amino acid 
sequences homologous to Tango- 77 protein molecules of the 
invention. To obtain gapped alignments for comparison 
purposes, Gapped BLAST can be utilized as described in 
Altschul et al. (1997) Nucleic Acids Res. 25:3389-3402. 

is When utilizing BLAST and Gapped BLAST programs, the 
default parameters of the respective programs (e.g., 
XBLAST and NBLAST) can be used. See 

http://www.ncbi.nlm.nih.gov. Another preferred, non- 
limiting example of a mathematical algorithm utilized for 

20 the comparison of sequences is the algorithm of Myers and 
Miller, CABIOS (1989) . Such an algorithm is incorporated 
into the ALIGN program (version 2.0) which is part of the 
GCG sequence alignment software package. When utilizing 
the ALIGN program for comparing amino acid sequences, a 

25 PAM120 weight residue table, a gap length penalty of 12, 
and a gap penalty of 4 can be used. 

The percent identity between two sequences can be 
determined using techniques similar to those described 
above, with or without allowing gaps. In calculating 

30 percent identity, only exact matches are counted. 

The invention also provides Tango- 77 chimeric or 
fusion proteins. As used herein, a Tango-77 "chimeric 
protein" or "fusion protein" comprises a Tango-77 
polypeptide operably linked to a non-Tango-77 

35 polypeptide. A "Tango-77 polypeptide" refers to a 



WO 99/06426 PCT/US98/1 61 02 

- 31 - 

polypeptide having an amino acid sequence corresponding 
to Tango-77 polypeptides, whereas a "non- Tango- 77 
polypeptide" refers to a polypeptide having an amino acid 
sequence corresponding to a protein which is not 
substantially identical to the Tango-77 protein, e.g., a 
protein which is different from the Tango-77 protein and 
which is derived from the same or a different organism. 
Within a Tango-77 fusion protein the Tango-77 polypeptide 
can correspond to all or a portion of a Tango-77 protein, 
preferably at least one biologically active portion of a 
Tango-77 protein. Within the fusion protein, the term 
"operably linked" is intended to indicate that the 
Tango-77 polypeptide and the non- Tango- 77 polypeptide are 
fused in-frame to each other. The non-Tango- 77 
polypeptide can be fused to the N-terminus or C-terminus 
of the Tango-77 polypeptide. 

One useful fusion protein is a GST-Tango- 77 fusion 
protein in which the Tango-77 sequences are fused to the 
C-terminus of the GST sequences. Such fusion proteins 
can facilitate the purification of recombinant Tango-77. 

In another embodiment, the fusion protein is a 
Tango-77 protein containing a heterologous signal 
sequence at its N-terminus. For example, the native 
Tango-77 signal sequence (i.e., about amino acids 1 to 63 
of SEQ ID NO:2; SEQ ID NO:4; or about amino acids 1 to 52 
of SEQ ID NO:7;SEQ ID NO: 8; or about amino acids 1 to 21 
of SEQ ID NO: 11; SEQ ID NO: 12) can be removed arid replaced 
with a signal sequence from another protein. In certain 
host cells (e.g., mammalian host cells), expression 
and/ or secretion of Tango-77 can be increased through use 
of a heterologous signal sequence. For example, the gp67 
secretory sequence of the baculovirus envelope protein 
can be used as a heterologous signal sequence (Ausubel et 
al., supra). Other examples of eukaryotic heterologous 
signal sequences include the secretory sequences of 
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melittin and human placental alkaline phosphatase 
(Stratagene; La Jolla, California) . In yet another 
example, useful prokaryotic heterologous signal sequences 
include the phoA secretory signal (Sambrook et al., 
5 supra) and the protein A secretory signal (Pharmacia 
Biotech; Piscataway, New Jersey) . 

In yet another embodiment, the fusion protein is 
an Tango- 77 -immunoglobulin fusion protein in which all or 
part of Tango- 77 is fused to sequences derived from a 

10 member of the immunoglobulin protein family. The 

Tango- 77 -immunoglobulin fusion proteins of the invention 
can be incorporated into pharmaceutical compositions and 
administered to a subject to inhibit an interaction 
between a Tango- 77 ligand and a Tango- 77 receptor on the 

is surface of a cell, to thereby suppress Tango- 77 -mediated 
signal transduction in vivo. The Tango- 77- immunoglobulin 
fusion proteins can be used to affect the bioavailability 
of a Tango-77 cognate ligand. Inhibition of the Tango-77 
ligand/Tango-77 interaction may be useful therapeutically 

20 for both the treatment of inflammatory and autoimmune 
disorders. Moreover, the Tango-77- immunoglobulin fusion 
proteins of the invention can be used as immunogens to 
produce anti-Tango-77 antibodies in a subject, to purify 
Tango-77 ligands and in screening assays to identify 

25 molecules which inhibit the interaction of Tango-77 with 
a Tango-77 receptor. 

Preferably, a Tango-77 chimeric or fusion protein 
of the invention is produced by standard recombinant DNA 
techniques. For example, DNA fragments coding for the 

30 different polypeptide sequences are ligated together in- 
frame in accordance with conventional techniques, for 
example by employing blunt -ended or stagger-ended termini 
for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling-in of cohesive ends as 

35 appropriate, alkaline phosphatase treatment to avoid 
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undesirable joining, and enzymatic ligation. In another 
embodiment, the fusion gene can be synthesized by 
conventional techniques including automated DNA 
synthesizers. Alternatively, PCR amplification of gene 
5 fragments can be carried out using anchor primers which 
give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be 
annealed and reamplified to generate a chimeric gene 
sequence (see, e.g., Current Protocols in Molecular 

10 Biology, Ausubel et al . eds . , John Wiley & Sons: 1992). 
Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g., a 
GST polypeptide) . An Tango- 77 -encoding nucleic acid can 
be cloned into such an expression vector such that the 

15 fusion moiety is linked in- frame to the Tango- 77 protein. 

The present invention also pertains to variants of 
the Tango-77 proteins (i.e., proteins having a sequence 
which differs from that of the Tango-77 amino acid 
sequence) . Such variants can function as either Tango-77 

20 agonists (mimetics) or as Tango-77 antagonists. Variants 
of the Tango-77 protein can be generated by mutagenesis, 
e.g., discrete point mutation or truncation of the 
Tango-77 protein. An agonist of the Tango-77 protein can 
retain substantially the same, or a subset, of the 

25 biological activities of the naturally occurring form of 
the Tango-77 protein. An antagonist of the Tango-77 
protein can inhibit one or more of the activities of the 
naturally occurring form of the Tango-77 protein by, for 
example, competitively binding to a downstream or 

30 upstream member of a cellular signaling cascade which 

includes the Tango-77 protein. Thus, specific biological 
effects can be elicited by treatment with a variant of 
limited function. Treatment of a subject with a variant 
having a. subset of the biological activities of the 

35 naturally occurring form of the protein can have fewer 
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side effects in a subject relative to treatment with the 
naturally occurring form of the Tango-77 proteins. 

Variants of the Tango-77 protein which function as 
either Tango-77 agonists (mimetics) or as Tango-77 
5 antagonists can be identified by screening combinatorial 
libraries of mutants, e.g., truncation mutants, of the 
Tango-77 protein for Tango-77 protein agonist or 
antagonist activity. In one embodiment, a variegated 
library of Tango-77 variants is generated by 

10 combinatorial mutagenesis at the nucleic acid level and 
is encoded by a variegated gene library. A variegated 
library of Tango-77 variants can be produced by, for 
example, enzymatically ligating a mixture of synthetic 
oligonucleotides into gene sequences such that a 

is degenerate set of potential Tango-77 sequences is 

expressible as individual polypeptides, or alternatively, 
as a set of larger fusion proteins (e.g., for phage 
display) containing the set of Tango-77 sequences 
therein. There are a variety of methods which can be 

20 used to produce libraries of potential Tango-77 variants 
from a degenerate oligonucleotide sequence. Chemical 
synthesis of a degenerate gene sequence can be performed 
in an automatic DNA synthesizer, and the synthetic gene 
then ligated into an appropriate expression vector. Use 

25 of a degenerate set of genes allows for the provision, in 
one mixture, of all of the sequences encoding the desired 
set of potential Tango-77 sequences. Methods for 
synthesizing degenerate oligonucleotides are known in the 
art {see, e.g., Narang (1983) Tetrahedron 39:3; Itakura 

30 et al . (1984) Annu. .Rev. Biochem. 53:323; Itakura et al . 
(1984) Science 198:1056; Ike et al . (1983) Nucleic Acid 
Res. 11:477) . 

In addition, libraries of fragments of the 
Tango-77 protein coding sequence can be used to generate 

35 a variegated population of Tango-77 fragments for 



WO 99/06426 



PCTYUS98/16102 



- 35 - 

screening and subsequent selection of variants of a 
Tango- 77 protein. In one embodiment, a library of coding 
sequence fragments can be generated by treating a double 
stranded PCR fragment of a Tango- 77 coding sequence with 
a nuclease under conditions wherein nicking occurs only 
about once per molecule, denaturing the double stranded 
DNA, renaturing the DNA to form double stranded DNA which 
can include sense/antisense pairs from different nicked 
products, removing single stranded portions from reformed 
duplexes by treatment with SI nuclease, and ligating the 
resulting fragment library into an expression vector. By 
this method, an expression library can be derived which 
encodes N-terminal and internal fragments of various 
sizes of the Tango-77 protein. 

Several techniques are known in the art for 
screening gene products of combinatorial libraries made 
by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. 
Such techniques are adaptable for rapid screening of the 
gene libraries generated by the combinatorial mutagenesis 
of Tango-77 proteins. The most widely used techniques, 
which are amenable to high through-put analysis, for 
screening large gene libraries typically include cloning 
the gene library into replicable expression vectors, 
transforming appropriate cells with the resulting library 
of vectors, and expressing the combinatorial genes under 
conditions in which detection of a desired activity 
facilitates isolation of the vector encoding the gene 
whose product was detected. Recursive ensemble 
mutagenesis (REM) , a technique which enhances the 
frequency of functional mutants in the libraries, can be 
used in combination with the screening assays to identify 
Tango-77 variants (Arkin and Yourvan (1992) Proc. Natl. 
Acad. Sci. USA 89:7811-7815; Delgrave et al . (1993) 
Protein Engineering 6(3) : 327-331) . 
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An isolated Tango-77 protein, or a portion or 
fragment thereof, can be used as an immunogen to generate 
antibodies that bind Tango-77 using standard techniques 
for polyclonal and monoclonal antibody preparation. The 
5 full-length Tango-77 protein can be used or, 

alternatively, the invention provides antigenic peptide 
fragments of Tango-77 for use as immunogens. The 
antigenic peptide of Tango-77 comprises at least 8 
(preferably 10, 15, 20, or 30) amino acid residues of the 

10 amino acid sequence shown in SEQ ID NO: 2, SEQ ID NO: 5, 
SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11 or SEQ ID NO: 13 
and encompasses an epitope of Tango-77 such that an 
antibody raised against the peptide forms a specific 
immune complex with Tango-77. 

15 A Tango-77 immunogen typically is used to prepare 

antibodies by immunizing a suitable subject (e.g., 
rabbit, goat, mouse or other mammal) with the immunogen. 
An appropriate immunogenic preparation can contain, for 
example, recombinant ly expressed Tango-77 protein or a 

20 chemically synthesized Tango-77 polypeptide. The 
preparation can further include an adjuvant, such as 
Freund's complete or incomplete adjuvant, or similar 
immunostimulatory agent. Immunization of a suitable 
subject with an immunogenic Tango-77 preparation induces 

25 a polyclonal anti -Tango- 77 antibody response. 

Accordingly, another aspect of the invention 
pertains to anti-Tango-77 antibodies. The term 
"antibody" as used herein refers to immunoglobulin 
molecules and immunologically active portions of 

30 immunoglobulin molecules, i.e., molecules that contain an 
antigen binding site which specifically binds an antigen, 
such as Tango-77. A molecule which specifically binds to 
Tango-77 is a molecule which binds Tango-77, but does not 
substantially bind other molecules in a sample, e.g., a 

35 biological sample, which naturally contains Tango-77. 
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Examples of immunologically active portions of 
immunoglobulin molecules include F(ab) and F(ab') 2 
fragments which can be generated by treating the antibody 
with an enzyme such as pepsin. The invention provides 
5 polyclonal and monoclonal antibodies that bind Tango- 77. 
The term "monoclonal antibody" or "monoclonal antibody 
composition", as used herein, refers to a population of 
antibody molecules that contain only one species of an 
antigen binding site capable of immunoreacting with a 

10 particular epitope of Tango-77. A monoclonal antibody 
composition thus typically displays a single binding 
affinity for a particular Tango-77 protein with which it 
immunoreacts . 

Polyclonal ant i -Tango-77 antibodies can be 

is prepared as described above by immunizing a suitable 
subject with a Tango-77 immunogen. The anti-Tango-77 
antibody titer in the immunized subject can be monitored 
over time by standard techniques, such as with an enzyme 
linked immunosorbent assay (ELISA) using immobilized 

20 Tango-77. If desired, the antibody molecules directed 
against Tango-77 can be isolated from the mammal (e.g., 
from the blood) and further purified by well-known 
techniques, such as protein A chromatography to obtain 
the IgG fraction. At an appropriate time after 

25 immunization, e.g., when the anti-Tango-77 antibody 
titers are highest, antibody-producing cells can be 
obtained from the subject and used to prepare monoclonal 
antibodies by standard techniques, such as the hybridoma 
technique originally described by Kohler and Milstein 

30 (1975) Nature 256:495-497, the human B cell hybridoma 
technique (Kozbor et al . (1983) Immunol Today 4:72), the 
EBV-hybridoma technique (Cole et al . (1985), Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 
77-96) or trioma techniques. The technology for 

35 producing hybridomas is well known (see generally Current 
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Protocols in Immunology (1994) Coligan et al . (eds.) John 
Wiley Sc Sons, Inc., New York, NY). Briefly, an immortal 
cell line (typically a myeloma) is fused to lymphocytes 
(typically splenocytes) from a mammal immunized with a 
5 Tango- 77 immunogen as described above, and the culture 
supernatants of the resulting hybridoma cells are 
screened to identify a hybridoma producing a monoclonal 
antibody that binds Tango- 77. 

Any of the many well known protocols used for 

10 fusing lymphocytes and immortalized cell lines can be 
applied for the purpose of generating an anti-Tango-77 
monoclonal antibody (see, e.g., Current Protocols in 
Immunology, supra; Galfre et al . (1977) Nature 266:55052; 
R.H. Kenneth, in Monoclonal Antibodies: A New Dimension 

15 In Biological Analyses, Plenum Publishing Corp., New 
York, New York (1980); and Lerner (1981) Yale J*. Biol. 
Med,, 54:387-402. Moreover, the ordinarily skilled 
worker will appreciate that there are many variations of 
such methods which also would be useful. Typically, the 

20 immortal cell line (e.g., a myeloma cell line) is derived 
from the same mammalian species as the lymphocytes. For 
example, murine hybridomas can be made by fusing 
lymphocytes from a mouse immunized with an immunogenic 
preparation of the present invention with an immortalized 

25 mouse cell line, e.g., a myeloma cell line that is 
sensitive to culture medium containing hypoxanthine, 
aminopterin and thymidine ("HAT medium") . Any of a 
number of myeloma cell lines can be used as a fusion 
partner according to standard techniques, e.g., the P3- 

30 NSl/l-Ag4-l, P3-x63-Ag8.653 or Sp2/0-Agl4 myeloma lines. 
These myeloma lines are available from ATCC. Typically, 
HAT-sensitive mouse myeloma cells are fused to mouse 
splenocytes using polyethylene glycol ("PEG"). Hybridoma 
cells resulting from the fusion are then selected using 

35 HAT medium, which kills unfused and unproductively fused 
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myeloma cells (unfused splenocytes die after several days 
because they are not transformed) . Hybridoma cells 
producing a monoclonal antibody of the invention are 
detected by screening the hybridoma culture supernatant s 
for antibodies that bind Tango-77, e.g., using a standard 
ELISA assay. 

Alternative to preparing monoclonal antibody- 
secreting hybridomas, a monoclonal anti-Tango-77 antibody 
can be identified and isolated by screening a recombinant 
combinatorial immunoglobulin library (e.g., an antibody 
phage display library) with Tango-77 to thereby isolate 
immunoglobulin library members that bind Tango-77. Kits 
for generating and screening phage display libraries are 
commercially available (e.g., the Pharmacia Recombinant 
Phage Antibody System, Catalog No. 27-94 00-01; and the 
Stratagene Surf ZAP™ Phage Display Kit, Catalog No. 
240612) . Additionally, examples of methods and reagents 
particularly amenable for use in generating and screening 
antibody display library can be found in, for example, 
U.S. Patent No. 5,223,409; PCT Publication No. WO 
92/18619; PCT Publication No. WO 91/17271; PCT 
Publication No. WO 92/20791; PCT Publication No. WO 
92/15679; PCT Publication No. WO 93/01288; PCT 
Publication No. WO 92/01047; PCT Publication No. WO 
92/09690; PCT Publication No. WO 90/02809; Fuchs et al . 
(1991) Bio/Technology 9:1370-1372; Hay et al . (1992) Hum. 
Antibod. Hybridomas 3:81-85; Huse et al . (1989) Science 
246:1275-1281; Griffiths et al . (1993) EMBO J 12:725-734. 

Additionally, recombinant anti-Tango-77 
antibodies, such as chimeric and humanized monoclonal 
antibodies, comprising both human and non-human portions, 
which can be made using standard recombinant DNA 
techniques, are within the scope of the invention. Such 
chimeric and humanized monoclonal antibodies can be 
produced by recombinant DNA techniques known in the art, 
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for example using methods described in PCT Publication 
No. WO 87/02671; European Patent Application 184,187; 
European Patent Application 171,496; European Patent 
Application 173,494; PCT Publication No. WO 86/01533; 
5 U.S. Patent No. 4,816,567; European Patent Application 
125,023; Better et al . (1988) Science 240:1041-1043; Liu 
et al. (1987) Proc. Natl. Acad. Sci. USA 84:3439-3443; 
Liu et al. (1987) J*. Immunol. 139:3521-3526; Sun et al . 
(1987) Proc. Natl. Acad. Sci. USA 84:214-218; Nishimura 

10 et al. (1987) Cane. Res. 47:999-1005; Wood et al . (1985) 
Nature 314:446-449; and Shaw et al . (1988) J. Natl. 
Cancer Inst. 80:1553-1559); Morrison (1985) Science 
229:1202-1207; Oi et al . (1986) Bio/Techniques 4:214; 
U.S. Patent 5,225,539; Jones et al . (1986) Nature 

15 321:552-525; Verhoeyan et al. (1988) Science 239:1534; 
and Beidler et al . (1988) J. Immunol. 141:4053-4060. 

Completely human antibodies are particularly 
desirable for therapeutic treatment of human patients. 
Such antibodies can be produced using transgenic mice 

20 which are incapable of expressing endogenous 

immunoglobulin heavy and light chains genes, but which 
can express human heave and light chain genes. The 
transgenic mice are immunized in the normal fashion with 
a selected antigen, e.g., all or a portion of Tango-77. 

25 Monoclonal antibodies directed against the antigen can be 
obtained using conventional hybridoma technology. The 
human immunoglobulin transgenes harbored by the 
transgenic mice rearrange during B cell differentiation, 
and subsequently undergo class switching and somatic 

30 mutation. Thus, using such a technique, it is possible 
to produce therapeutically useful IgG, IgA and IgE 
antibodies. For an overview of this technology for 
producing human antibodies, see Lonberg and Huszar (1995, 
Int. Rev. Immunol. 13:65-93). For a detailed discussion 

35 of this technology for producing human antibodies and 
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human monoclonal antibodies and protocols for producing 
such antibodies, see, e.g., U.S. Patent 5,625,126; U.S. 
Patent 5,633,425; U.S. Patent 5,569,825; U.S. Patent 
5,661,016; and U.S. Patent 5,545,806. In addition, 
companies such as Abgenix, Inc. (Freemont, CA) , can be 
engaged to provide human antibodies directed against a 
selected antigen using technology similar to the 
described above. 

Completely human antibodies which recognize a 
selected epitope can be generated using a technique 
referred to as "guided selection." In this approach a 
selected non-human monoclonal antibody, e.g., a murine 
antibody, is used to guide the selection of a completely 
human antibody recognizing the same epitope. 

First, a non-human monoclonal antibody which binds 
a selected antigen (epitope), e.g., an antibody which 
inhibits Tango-77 activity, is identified. The heave 
chain and the light chain of the non-human antibody are 
cloned and used to create phage display Fab fragments. 
For example, the heave chain gene can be cloned into a 
plasmid vector so that the heavy chain can be secreted 
from bacteria. The light chain gene can be cloned into a 
phage coat protein gene so that the light chain can be 
expressed on the surface of phage. A repertoire (random 
collection) of human light chains fused to phage is used 
to infect the bacteria which express the non-human heavy 
chain. The resulting progeny phage display hybrid 
antibodies (human light chain/non-human heavy chain) . 
The selected antigen is used in a panning screen to 
select phage which bind the selected antigen. Several 
rounds of selection may be required to identify such 
phage. Next, human light chain genes are isolated from 
the selected phage which bind the selected antigen. 
These selected human light chain genes are then used to 
guide the selection of human heavy chain genes as 
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follows. The selected human light chain genes are 
inserted into vectors for expression by bacteria. 
Bacteria expressing the selected human light chains are 
infected with a repertoire of human heavy chains fused to 
5 phage. The resulting progeny phage display human 
antibodies (human light chain/human heavy chain) . 

Next, the selected antigen is used in a panning 
screen to select phage which bind the selected antigen. 
The phage selected in this step display completely human 

10 antibody which recognize the same epitope recognized by 
the original selected, non-human monoclonal antibody. 
The genes encoding both the heavy and light chains are 
readily isolated and be further manipulated for 
production of human antibody. This technology is 

15 described by Jespers et al. (1994, Bio/ technology 12:899- 
903) . 

An anti -Tango- 77 antibody (e.g., monoclonal 
antibody) can be used to isolate Tango-77 by standard 
techniques, such as affinity chromatography or 

20 immunoprecipitation. An anti-Tango-77 antibody can 
facilitate the purification of natural Tango-77 from 
cells and of recombinant ly produced Tango-77 expressed in 
host cells. Moreover, an anti-Tango-77 antibody can be 
used to detect Tango-77 protein (e.g., in a cellular 

25 lysate or cell supernatant) in order to evaluate the 
abundance and pattern of expression of the Tango-77 
protein. Anti-Tango-77 antibodies can be used 
diagnostically to monitor protein levels in tissue as 
part of a clinical testing procedure, e.g., to, for 

30 example, determine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling the 
antibody to a detectable substance. Examples of 
detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, luminescent materials, 

35 biolumine scent materials, and radioactive materials. 
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Examples of suitable enzymes include horseradish 
peroxidase, alkaline phosphatase, /3-galactosidase, or 
acetylcholinesterase; examples of suitable prosthetic 
group complexes include streptavidin/biotin and 
5 avidin/biotin; examples of suitable fluorescent materials 
include umbellif erone, fluorescein, fluorescein 
isothiocyanate, rhodamine , dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example 
of a luminescent material includes luminol ; examples of 
10 bioluminescent materials include lucif erase, luciferin, 
and aequorin, and examples of suitable radioactive 
material include 125 1 , 131 I, 35 S or 3 H. 

Ill, Recombinant Expression Vectors and Host Cells 
Another aspect of the invention pertains to 

is vectors, preferably expression vectors, containing a 
nucleic acid molecule encoding Tango-77 (or a portion 
thereof) . As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting another 
nucleic acid to which it has been linked- One type of 

20 vector is a "plasmid", which refers to a circular double 
stranded DNA loop into which additional DNA segments can 
be ligated. Another type of vector is a viral vector, 
wherein additional DNA segments can be ligated into the 
viral genome. Certain vectors are capable of autonomous 

25 replication in a host cell into which they are introduced 
(e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors) . Other 
vectors (e.g., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon 

30 introduction into the host cell, and thereby are 

replicated along with the host genome. Moreover, certain 
vectors, expression vectors, are capable of directing the 
expression of genes to which they are operably linked. 
In general, expression vectors of utility in recombinant 
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DNA techniques are often in the form of plasmids 
(vectors) . However, the invention is intended to include 
such other forms of expression vectors, such as viral 
vectors (e.g., replication defective retroviruses, 
5 adenoviruses and adeno-associated viruses) , which serve 
equivalent functions . 

The recombinant expression vectors of the 
invention comprise a nucleic acid of the invention in a 
form suitable for expression of the nucleic acid in a 

10 host cell, which means that the recombinant expression 
vectors include one or more regulatory sequences, 
selected on the basis of the host cells to be used for 
expression, which is operably linked to the nucleic acid 
sequence to be expressed- Within a recombinant 

is expression vector, "operably linked" is intended to mean 
that the nucleotide sequence of interest is linked to the 
regulatory sequence (s) in a manner which allows for 
expression of the nucleotide sequence (e.g., in an in 
vitro transcription/translation system or in a host cell 

20 when the vector is introduced into the host cell) . The 
term "regulatory sequence" is intended to include 
promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such 
regulatory sequences are described, for example, in 

25 Goeddel; Gene Expression Technology: Methods in 

Enzymology 185, Academic Press, San Diego, CA (1990) . 
Regulatory sequences include those which direct 
constitutive expression of a nucleotide sequence in many 
types of host cell and those which direct expression of 

30 the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences) . It will be 
appreciated by those skilled in the art that the design 
of the expression vector can depend on such factors as 
the choice of the host cell to be transformed, the level 

35 of expression of protein desired, etc. The expression 
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vectors of the invention can be introduced into host 
cells to thereby produce proteins or peptides, including 
fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g., Tango- 77 proteins, mutant forms 
5 of Tango-77, fusion proteins, etc.) . 

The recombinant expression vectors of the 
invention can be designed for expression of Tango-77 in 
prokaryotic or eukaryotic cells, e.g., bacterial cells 
such as E. coli, insect cells (using baculovirus 

10 expression vectors), yeast cells or mammalian cells. 
Suitable host cells are discussed further in Goeddel, 
Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, CA (1990) . Alternatively, the 
recombinant expression vector can be transcribed and 

is translated in vitro, for example using T7 promoter 
regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most 
often carried out in E. coli with vectors containing 
constitutive or inducible promoters directing the 

20 expression of either fusion or non- fusion proteins. 

Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the 
recombinant protein. Such fusion vectors typically serve 
three purposes: 1) to increase expression of recombinant 

25 protein; 2) to increase the solubility of the recombinant 
protein; and 3) to aid in the purification of the 
recombinant protein by acting as a ligand in affinity 
purification. Often, in fusion expression vectors, a 
proteolytic cleavage site is introduced at the junction 

30 of the fusion moiety and the recombinant protein to 
enable separation of the recombinant protein from the 
fusion moiety subsequent to purification of the fusion 
protein. Such enzymes, and their cognate recognition 
sequences, include Factor Xa, thrombin and enterokinase . 

35 Typical fusion expression vectors include pGEX (Pharmacia 
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Biotech Inc; Smith and Johnson (1988) Gene 67:31-40), 
pMAL (New England Biolabs, Beverly, MA) and pRIT5 
(Pharmacia, Piscataway, NJ) which fuse glutathione S- 
transf erase (GST) , maltose E binding protein, or protein 
A, respectively, to the target recombinant protein. 

Examples of suitable inducible non- fusion E. coli 
expression vectors include pTrc (Amann et al . (1988) Gene 
69:301-315) and pET lid (Studier et al . , Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, 
San Diego, California (1990) 60-89) . Target gene 
expression from the pTrc vector relies on host RNA 
polymerase transcription from a hybrid trp-lac fusion 
promoter. Target gene expression from the pET lid vector 
relies on transcription from a T7 gnlO-lac fusion 
promoter mediated by a coexpressed viral RNA polymerase 
(T7 gnl) . This viral polymerase is supplied by host 
strains BL21(DE3) or HMS174(DE3) from a resident X 
prophage harboring a T7 gnl gene under the 
transcriptional control of the lacUV 5 promoter. 

One strategy to maximize recombinant protein 
expression in E. coli is to express the protein in a host 
bacteria with an impaired capacity to proteolytically 
cleave the recombinant protein (Gottesman, Gene 
Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 119-128) . 
Another strategy is to alter the nucleic acid sequence of 
the nucleic acid to be inserted into an expression vector 
so that the individual codons for each amino acid are 
those preferentially utilized in E. coli (Wada et al. 
(1992) Nucleic Acids Res. 20:2111-2118). Such alteration 
of nucleic acid sequences of the invention can be carried 
out by standard DNA synthesis techniques. 

In another embodiment, the Tango-77 expression 
vector is a yeast expression vector. Examples of vectors 
for expression in yeast S. cerivisae include pYepSecl 
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(Baldari et al . (1987) EMBO J. 6:229-234), pMFa (Kurjan 
and Herskowitz (1982) Cell 30:933-943), pJRY88 (Schultz 
et al. (1987) Gene 54:113-123), pYES2 (Invitrogen 
Corporation, San Diego, CA) , and picZ (InVitrogen Corp, 
5 San Diego, CA) . 

Alternatively, Tango- 77 can be expressed in insect 
cells using baculovirus expression vectors. Baculovirus 
vectors available for expression of proteins in cultured 
insect cells (e.g., Sf 9 cells) include the pAc series 
10 (Smith et al . (1983) Mol. Cell Biol. 3:2156-2165) and the 
pVL series (Lucklow and Summers (1989) Virology 170:31- 
39) . 

In yet another embodiment, a nucleic acid of the 
invention is expressed in mammalian cells using a 

is mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed (1987) Nature 
329:840) and pMT2PC (Kaufman et al . (1987) EMBO J. 6:187- 
195) . When used in mammalian cells, the expression 
vector's control functions are often provided by viral 

20 regulatory elements. For example, commonly used 
promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable 
expression systems for both prokaryotic and eukaryotic 
cells see chapters 16 and 17 of Sambrook et al . (supra) . 

25 In another embodiment, the recombinant mammalian 

expression vector is capable of directing expression of 
the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to 
express the nucleic acid) . Tissue-specific regulatory 

30 elements are known in the art. Non-limiting examples of 
suitable tissue-specific promoters include the albumin 
promoter (liver-specific; Pinkert et al. (1987) Genes 
Dev. 1:268-277) , lymphoid-specif ic promoters (Calame and 
Eaton (1988) Adv. Immunol. 43:235-275), in particular 

35 promoters of T cell receptors (Winoto and Baltimore 
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(1989) EMBO J. 8:729-733) and immunoglobulins (Banerji et 
al. (1983) Cell 33:729-740; Queen and Baltimore (1983) 
Cell 33:741-748), neuron-specif ic promoters (e.g., the 
neurofilament promoter; Byrne and Ruddle (1989) Proc. 
5 Natl. Acad. Sci. USA 86:5473-5477), pancreas-specific 

promoters (Edlund et al . (1985) Science 230:912-916), and 
mammary gland-specific promoters (e.g., milk whey 
promoter; U.S. Patent No. 4,873,316 and European 
Application Publication No. 264,166). Development ally - 
10 regulated promoters are also encompassed, for example the 
murine hox promoters (Kessel and Gruss (1990) Science 
249:374-379) and the a-f etoprotein promoter (Campes and 
Tilghman (1989) Genes Dev. 3:537-546). 

The invention further provides a recombinant 
15 expression vector comprising a DNA molecule of the 
invention cloned into the expression vector in an 
antisense orientation. That is, the DNA molecule is 
operably linked to a regulatory sequence in a manner 
which allows for expression (by transcription of the DNA 
20 molecule) of an RNA molecule which is antisense to 

Tango- 77 mRNA. Regulatory sequences operably linked to a 
nucleic acid cloned in the antisense orientation can be 
chosen which direct the continuous expression of the 
antisense RNA molecule in a variety of cell types, for 
25 instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen which direct constitutive, tissue 
specific or cell type specific expression of antisense 
RNA. The antisense expression vector can be in the form 
of a recombinant plasmid, phagemid or attenuated virus in 
30 which antisense nucleic acids are produced under the 
control of a high efficiency regulatory region, the 
activity of which can be determined by the cell type into 
which the vector is introduced. For a discussion of the 
regulation of gene expression using antisense genes see 
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Weintraub et al . (.Reviews - Trends in Genetics, Vol. 1(1) 
1986) . 

Another aspect of the invention pertains to host 
cells into which a recombinant expression vector of the 
5 invention has been introduced . The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. 
It is understood that such terms refer not only to the 
particular subject cell but to the progeny or potential 
progeny of such a cell. Because certain modifications 
10 may occur in succeeding generations due to either 

mutation or environmental influences, such progeny may 
not, in fact, be identical to the parent cell, but are 
still included within the scope of the term as used 
herein. 

is A host cell can be any prokaryotic or eukaryotic 

cell. For example, Tango-77 protein can be expressed in 
bacterial cells such as E. coli, insect cells, yeast or 
mammalian cells (such as Chinese hamster ovary cells 
(CHO) or COS cells) . Other suitable host cells are known 

20 to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or 
eukaryotic cells via conventional transformation or 
transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer 

25 to a variety of art -recognized techniques for introducing 
foreign nucleic acid (e.g., DNA) into a host cell, 
including calcium phosphate or calcium chloride co- 
precipitation, DEAE-dextran-mediated transfection, 
lipofection, or electroporation. Suitable methods for 

30 transforming or transfecting host cells can be found in 
Sambrook, et al . (supra), and other laboratory manuals. 

For stable transfection of mammalian cells, it is 
known that, depending upon the expression vector and 
transfection technique used, only a small fraction of 

35 cells may integrate the foreign DNA into their genome. 
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In order to identify and select these integrants, a gene 
that encodes a selectable marker (e.g., for resistance to 
antibiotics) is generally introduced into the host cells 
along with the gene of interest. Preferred selectable 
5 markers include those which confer resistance to drugs, 
such as G418, hygromycin and methotrexate. Nucleic acid 
encoding a selectable marker can be introduced into a 
host cell on the same vector as that encoding Tango-77 or 
can be introduced on a separate vector. Cells stably 

10 transfected with the introduced nucleic acid can be 
identified by drug selection (e.g., cells that have 
incorporated the selectable marker gene will survive, 
while the other cells die) . 

A host cell of the invention, such as a 

is prokaryotic or eukaryotic host cell in culture, can be 
used to produce (i.e., express) Tango-77 protein. 
Accordingly, the invention further provides methods for 
producing Tango-77 protein using the host cells of the 
invention. In one embodiment, the method comprises 

20 culturing the host cell of invention (into which a 

recombinant expression vector encoding Tango-77 has been 
introduced) in a suitable medium such that Tango-77 
protein is produced. In another embodiment, the method 
further comprises isolating Tango-77 from the medium or 

25 the host cell. 

The host cells of the invention can also be used 
to produce nonhuman transgenic animals. For example, in 
one embodiment, a host cell of the invention is a 
fertilized oocyte or an embryonic stem cell into which 

30 Tango- 77 -coding sequences have been introduced. Such 
host cells can then be used to create non- human 
transgenic animals in which exogenous Tango-77 sequences 
have been introduced into their genome or homologous 
recombinant animals in which endogenous Tango-77 

35 sequences have been altered. Such animals are useful for 
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studying the function and/or activity of Tango- 77 and for 
identifying and/or evaluating modulators of Tango-77 
activity. As used herein, a "transgenic animal" is a 
non-human animal, preferably a mammal, more preferably a 
5 rodent such as a rat or mouse, in which one or more of 
the cells of the animal includes a transgene. Other 
examples of transgenic animals include non-human 
primates, sheep, dogs, cows, goats, chickens, amphibians, 
etc. A transgene is exogenous DNA which is integrated 

10 into the genome of a cell from which a transgenic animal 
develops and which remains in the genome of the mature 
animal, thereby directing the expression of an encoded 
gene product in one or more cell types or tissues of the 
transgenic animal. As used herein, an "homologous 

is recombinant animal" is a non-human animal, preferably a 
mammal, more preferably a mouse, in which an endogenous 
Tango-77 gene has been altered by homologous 
recombination between the endogenous gene and an 
exogenous DNA molecule introduced into a cell of the 

20 animal, e.g., an embryonic cell of the animal, prior to 
development of the animal . 

A transgenic animal of the invention can be 
created by introducing Tango- 77 -encoding nucleic acid 
into the male pronuclei of a fertilized oocyte, e.g., by 

25 microinjection, retroviral infection, and allowing the 
oocyte to develop in a pseudopregnant female foster 
animal. The Tango-77 cDNA sequence e.g., that of (SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:6; SEQ ID NO:10 or the cDNA 
of ATCC 98807) can be introduced as a transgene into the 

30 genome of a non-human animal. Alternatively, a nonhuman 
homologue of the human Tango-77 gene, such as a mouse 
Tango-77 gene, can be isolated based on hybridization to 
the human Tango-77 cDNA and used as a transgene. 
Intronic sequences and polyadenylation signals can also 

35 be included in the transgene to increase the efficiency 
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of expression of the transgene. A tissue-specific 
regulatory sequence (s) can be operably linked to the 
Tango- 77 transgene to direct expression of Tango- 77 
protein to particular cells. Methods for generating 
5 transgenic animals via embryo manipulation and 

microinjection, particularly animals such as mice, have 
become conventional in the art and are described, for 
example, in U.S. Patent Nos. 4,736,866 and 4,870,009, 
U.S. Patent No. 4,873,191 and in Hogan, Manipulating the 

10 Mouse Embryo (Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, N.Y., 1986). Similar methods are used for 
production of other transgenic animals. A transgenic 
founder animal can be identified based upon the presence 
of the Tango- 77 transgene in its genome and/or expression 

is of Tango-77 mRNA in tissues or cells of the animals. A 
transgenic founder animal can then be used to breed 
additional animals carrying the transgene. Moreover, 
transgenic animals carrying a transgene encoding Tango-77 
can further be bred to other transgenic animals carrying 

20 other transgenes. 

To create an homologous recombinant animal, a 
vector is prepared which contains at least a portion of a 
Tango-77 gene (e.g., a human or a non-human homolog of 
the Tango-77 gene, e.g., a murine Tango-77 gene) into 

25 which a deletion, addition or substitution has been 

introduced to thereby alter, e.g., functionally disrupt, 
the Tango-77 gene. In a preferred embodiment, the vector 
is designed such that, upon homologous recombination, the 
endogenous Tango-77 gene is functionally disrupted (i.e., 

30 no longer encodes a functional protein; also referred to 
as a "knock out" vector) . Alternatively, the vector 

can be designed such that, upon homologous recombination, 
the endogenous Tango-77 gene is mutated or otherwise 
altered but still encodes functional protein (e.g., the 

35 upstream regulatory region can be altered to thereby 
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alter the expression of the endogenous Tango- 77 protein) . 
In the homologous recombination vector, the altered 
portion of the Tango- 77 gene is flanked at its 5' and 3' 
ends by additional nucleic acid of the Tango- 77 gene to 
5 allow for homologous recombination to occur between the 
exogenous Tango- 77 gene carried by the vector and an 
endogenous Tango-77 gene in an embryonic stem cell. The 
additional flanking Tango-77 nucleic acid is of 
sufficient length for successful homologous recombination 

10 with the endogenous gene. Typically, several kilobases 
of flanking DNA (both at the 5' and 3' ends) are included 
in the vector {see, e.g., Thomas and Capecchi (1987) Cell 
51:503 for a description of homologous recombination 
vectors) . The vector is introduced into an embryonic 

is stem cell line (e.g., by electroporation) and cells in 
which the introduced Tango-77 gene has homologously 
recombined with the endogenous Tango-77 gene are selected 
{see, e.g., Li et al . (1992) Cell 69:915). The selected 
cells are then injected into a blastocyst of an animal 

20 (e.g., a mouse) to form aggregation chimeras {see, e.g., 
Bradley in Teratocarcinomas and Embryonic Stem Cells: A 
Practical Approach, Robertson, ed. (IRL, Oxford, 1987) 
pp. 113-152) . A chimeric embryo can then be implanted 
into a suitable pseudopregnant female foster animal and 

25 the embryo brought to term. Progeny harboring the 

homologously recombined DNA in their germ cells can be 
used to breed animals in which all cells of the animal 
contain the homologously recombined DNA by germline 
transmission of the transgene. Methods for constructing 

30 homologous recombination vectors and homologous 

recombinant animals are described further in Bradley 
(1991) Current Opinion in Bio/Technology 2:823-829 and in 
PCT Publication Nos. WO 90/11354, WO 91/01140, WO 
92/0968, and WO 93/04169. 
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In another embodiment, transgenic non-human 
animals can be produced which contain selected systems 
which allow for regulated expression of the transgene. 
One example of such a system is the cre/loxP recombinase 
system of bacteriophage PI . For a description of the 
cre/loxP recombinase system, see, e.g., Lakso et al . 
(1992) Proc. Natl. Acad. Sci. USA 89:6232-6236. Another 
example of a recombinase system is the FLP recombinase 
system of Saccharomyces cerevisiae (O' Gorman et al. 
(1991) Science 251:1351-1355. If a cre/loxP recombinase 
system is used to regulate expression of the transgene, 
animals containing transgenes encoding both the Cre 
recombinase and a selected protein are required. Such 
animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two 
transgenic animals, one containing a transgene encoding a 
selected protein and the other containing a transgene 
encoding a recombinase. 

Clones of the non-human transgenic animals 
described herein can also be produced according to the 
methods described in Wilmut et al . (1997) Nature 385:810- 
813 and PCT Publication Nos. WO 97/07668 and WO 97/07669. 
In brief, a cell, e.g., a somatic cell, from the 
transgenic animal can be isolated and induced to exit the 
growth cycle and enter G 0 phase. The quiescent cell can 
then be fused, e.g., through the use of electrical 
pulses, to an enucleated oocyte from an animal of the 
same species from which the quiescent cell is isolated. 
The reconstructed oocyte is then cultured such that it 
develops to morula or blastocyte and then transferred to 
pseudopregnant female foster animal. The offspring borne 
of this female foster animal will be a clone of the 
animal from which the cell, e.g., the somatic cell, is 
isolated. 
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IV. Pharmaceutical Compositions 

The Tango- 77 nucleic acid molecules, Tango- 77 
proteins, and anti-Tango-77 antibodies (also referred to 
herein as "active compounds") of the invention can be 
incorporated into pharmaceutical compositions suitable 
for administration. Such compositions typically comprise 
the nucleic acid molecule, protein, or antibody and a 
pharmaceutically acceptable carrier. As used herein the 
language "pharmaceutically acceptable carrier" is 
intended to include any and all solvents, dispersion 
media, coatings, antibacterial and antifungal agents, 
isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. The use 
of such media and agents for pharmaceutically active 
substances is. well known in the art. Except insofar as 
any conventional media or agent is incompatible with the 
active compound, use thereof in the compositions is 
contemplated. Supplementary active compounds can also be 
incorporated into the compositions. 

A pharmaceutical composition of the invention is 
formulated to be compatible with its intended route of 
administration. Examples of routes of administration 
include parenteral, (e.g. intravenous, intradermal, 
subcutaneous) (e.g., oral inhalation), transdermal 
(topical), transmucosal, and rectal administration. 
Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include the 
following components: a sterile diluent such as water for 
injection, saline solution, fixed oils, polyethylene 
glycols, glycerine, propylene glycol or other synthetic 
solvents; antibacterial agents such as benzyl alcohol or 
methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as 
ethylenediaminetetraacetic acid; buffers such as 
acetates, citrates or phosphates and agents for the 



WO 99/06426 



PCT/US98/16102 



- 56 - 

adjustment of tonicity such as sodium chloride or 
dextrose. pH can be adjusted with acids or bases, such 
as hydrochloric acid or sodium hydroxide. The parenteral 
preparation can be enclosed in ampoules, disposable 
5 syringes or multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for 
injectable use include sterile aqueous solutions (where 
water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable 

10 solutions or dispersions. For intravenous 

administration, suitable carriers include physiological 
saline, bacteriostatic water, Cremophor EL™ (BASF; 
Parsippany, NJ) or phosphate buffered saline (PBS) . In 
all cases, the composition must be sterile and should be 

15 fluid to the extent that easy syringability exists. It 
must be stable under the conditions of manufacture and 
storage and must be preserved against the contaminating 
action of microorganisms such as bacteria and fungi. The 
carrier can be a solvent or dispersion medium containing, 

20 for example, water, ethanol, polyol (for example, 
glycerol, propylene glycol, and liquid polyetheylene 
glycol, and the like), and suitable mixtures thereof. 
The proper fluidity can be maintained, for example, by 
the use of a coating such as lecithin, by the maintenance 

25 of the required particle size in the case of dispersion 
and by the use of surfactants. Prevention of the action 
of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, 
parabens, chlorobutanol , phenol, ascorbic acid, 

30 thimerosal, and the like. In many cases, it will be 
preferable to include isotonic agents, for example, 
sugars, polyalcohols such as mannitol, sorbitol, sodium 
chloride in the composition. Prolonged absorption of the 
injectable compositions can be brought about by including 
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in the composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by 
incorporating the active compound (e.g., a Tango-77 
5 protein or anti -Tango- 77 antibody) in the required amount 
in an appropriate solvent with one or a combination of 
ingredients enumerated above, as required, followed by 
filtered sterilization. Generally, dispersions are 
prepared by incorporating the active compound into a 

10 sterile vehicle which contains a basic dispersion medium 
and the required other ingredients from those enumerated 
above. In the case of sterile powders for the 
preparation of sterile injectable solutions, the 
preferred methods of preparation are vacuum drying and 

15 f reeze-drying which yields a powder of the active 

ingredient plus any additional desired ingredient from a 
previously sterile-filtered solution thereof. 

Oral compositions generally include an inert 
diluent or an edible carrier. They can be enclosed in 

20 gelatin capsules or compressed into tablets. For the 
purpose of oral therapeutic administration, the active 
compound can be incorporated with excipients and used in 
the form of tablets, troches, or capsules. Oral 
compositions can also be prepared using a fluid carrier 

25 for use as a mouthwash, wherein the compound in the fluid 
carrier is applied orally and swished and expectorated or 
swallowed. Pharmaceutically compatible binding agents, 
and/or adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules, troches and 

30 the like can contain any of the following ingredients, or 
compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an 
excipient such as starch or lactose, a disintegrating 
agent such as alginic acid, Primogel, or corn starch; a 

35 lubricant such as magnesium stearate or Sterotes; a 
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glidant such as colloidal silicon dioxide; a sweetening 
agent such as sucrose or saccharin; or a flavoring agent 
such as peppermint, methyl salicylate, or orange 
flavoring. 

5 For administration by inhalation, the compounds 

are delivered in the form of an aerosol spray from a 
pressurized container or dispenser which contains a 
suitable propellant, e.g., a gas such as carbon dioxide, 
or a nebulizer. 

io Systemic administration can also be by 

transmucosal or transdermal means. For transmucosal or 
transdermal administration, penetrants appropriate to the 
barrier to be permeated are used in the formulation. 
Such penetrants are generally known in the art, and 

15 include, for example, for transmucosal administration, 
detergents, bile salts, and fusidic acid derivatives. 
Transmucosal administration can be accomplished through 
the use of nasal sprays or suppositories. For 
transdermal administration, the active compounds are 

20 formulated into ointments, salves, gels, or creams as 
generally known in the art. 

The compounds can also be prepared in the form of 
suppositories (e.g., with conventional suppository bases 
such as cocoa butter and other glycerides) or retention 

25 enemas for rectal delivery. 

In one embodiment, the active compounds are 
prepared with carriers that will protect the compound 
against rapid elimination from the body, such as a 
controlled release formulation, including implants and 

30 microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene 
vinyl acetate, polyanhydrides, polyglycolic acid, 
collagen, polyorthoesters, and polylactic acid. Methods 
for preparation of such formulations will be apparent to 

35 those skilled in the art. The materials can also be 
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obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including 
liposomes targeted to infected cells with monoclonal 
antibodies to viral antigens) can also be used as 
5 pharmaceutically acceptable carriers. These can be 
prepared according to methods known to those skilled in 
the art, for example, as described in U.S. Patent No. 
4,522,811. 

It is especially advantageous to formulate oral or 

10 parenteral compositions in dosage unit form for ease of 
administration and uniformity of dosage. Dosage unit 
form as used herein refers to physically discrete units 
suited as unitary dosages for the subject to be treated; 
each unit containing a predetermined quantity of active 

15 compound calculated to produce the desired therapeutic 
effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of 
the invention are dictated by and directly dependent on 
the unique characteristics of the active compound and the 

20 particular therapeutic effect to be achieved, and the 
limitations inherent in the art of compounding such an 
active compound for the treatment of individuals . 

The nucleic acid molecules of the invention can be 
inserted into vectors and used as gene therapy vectors . 

25 Gene therapy vectors can be delivered to a subject by, 
for example, intravenous injection, local administration 
(U.S. Patent 5,328,470) or by stereotactic injection 
(see, e.g., Chen et al . (1994) Proc. Natl. Acad. Sci. USA 
91:3054-3057). The pharmaceutical preparation of the 

30 gene therapy vector can include the gene therapy vector 
in an acceptable diluent, or can comprise a slow release 
matrix in which the gene delivery vehicle is imbedded. 
Alternatively, where the complete gene delivery vector 
can be produced intact from recombinant cells, e.g. 

35 retroviral vectors, the pharmaceutical preparation can 
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include one or more cells which produce the gene delivery- 
system. 

The pharmaceutical compositions can be included in 
a container, pack, or dispenser together with 
5 instructions for administration. 

V. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein 
homologues, and antibodies described herein can be used 
in one or more of the following methods: a) screening 

10 assays; b) detection assays (e.g., chromosomal mapping, 
tissue typing, forensic biology) ; c) predictive medicine 
(e.g., diagnostic assays, prognostic assays, monitoring 
clinical trials, and pharmacogenomics) ; and d) methods of 
treatment (e.g., therapeutic and prophylactic) . A 

is Tango- 77 protein interacts with other cellular proteins 
and can thus be used for regulation of inflammation. The 
polypeptides of the invention can be used in assays to 
determine biological activity. For example, they could 
be used in a panel of proteins for high- throughput 

20 screening. 

The isolated nucleic acid molecules of the 
invention can be used to express Tango-77 protein (e.g., 
via a recombinant expression vector in a host cell in 
gene therapy applications) , to detect Tango-77 mRNA 

25 (e.g., in a biological sample) or a genetic lesion in a 
Tango-77 gene, and to modulate Tango-77 activity. In 
addition, the Tango-77 proteins can be used to screen 
drugs or compounds which modulate the Tango-77 activity 
or expression as well as to treat disorders characterized 

30 by insufficient or excessive production of Tango-77 
protein or production of Tango-77 protein forms which 
have decreased or aberrant activity compared to Tango-77 
wild type protein. In addition, the anti-Tango-77 
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antibodies of the invention can be used to detect and 
isolate Tango-77 proteins and modulate Tango-77 activity. 

This invention further pertains to novel agents 
identified by the above -described screening assays and 
5 uses thereof for treatments as described herein. 

A. Screening Assays 

The invention provides a method (also referred to 

herein as a "screening assay") for identifying 

modulators, i.e., candidate or test compounds or agents 
10 (e.g., peptides, peptidomimetics , small molecules or 

other drugs) which bind to Tango-77 proteins or have a 

stimulatory or inhibitory effect on, for example, 

Tango-77 expression or Tango-77 activity. 

Examples of methods for the synthesis of molecular 
is libraries can be found in the art, for example in: 

DeWitt et al . (1993) Proc. Natl. Acad. Sci . USA 90:6909; 

Erb et al . (1994) Proc. Natl. Acad. Sci. USA 91:11422; 

Zuckermann et al . (1994). J. Med. Chem. 37:2678; Cho et 

al. (1993) Science 261:1303; Carrell et al . (1994) Angew. 
20 Chem. Int. Ed. Engl. 33:2059; Carell et al . (1994) Angew. 

Chem. Int. Ed. Engl. 33:2061; and Gallop et al . (1994) J". 

Med. Chem. 37:1233. 

Libraries of compounds may be presented in 

solution (e.g., Houghten (1992) Bio /Techniques 13:412- 
25 421), or on beads (Lam (1991) -Nature 354:82-84), chips 
(Fodor (1993) Nature 364 : 555-556) , bacteria (U.S. Patent 

No. 5,223,409), spores (Patent Nos. 5,571,698; 5,403,484; 

and 5,223,409), plasmids (Cull et al . (1992) Proc. Natl. 

Acad. Sci. USA 89:1865-1869) or phage (Scott and Smith 
30 (1990) Science 249:386-390; Devlin (1990) Science 

249:404-406; Cwirla et al . (1990) Proc. Natl. Acad. Sci. 

USA 87:6378-6382; and Felici (1991) J. Mol. Biol. 

222 :301-310) . 
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In another embodiment, an assay is used to 
determine the ability of the test compound to modulate 
the activity of Tango-77 or a biologically active portion 
thereof, for example, by determining the ability of the 
5 Tango-77 protein to bind to or interact with a Tango-77 
target molecule. As used herein, a "target molecule" is 
a molecule with which a Tango-77 protein binds or 
interacts in nature, for example, a molecule on the 
surface of a cell, A Tango-77 target molecule can be a 

10 non-Tango- 77 molecule or a Tango-77 protein or 

polypeptide of the present invention. In one embodiment, 
a Tango-77 target molecule is a component of a signal 
transduction pathway, for example, Tango-77 may bind to a 
IL-1 receptor or another receptor thereby blocking the 

15 receptor and inhibiting future signal transduction. 

Determining the ability of the Tango-77 protein to bind 
to or interact with a Tango-77 target molecule can be 
accomplished by one of the methods described above. In a 
preferred embodiment, determining the ability of the 

20 Tango-77 protein to bind to or interact with a Tango-77 
target molecule can be accomplished by determining the 
activity of the target molecule. For example, the 
activity of the target molecule can be determined by 
detecting induction of a cellular second messenger of the 

25 target (e.g., intracellular Ca 2+ , diacylglycerol , IP3, 
etc.), detecting catalytic/enzymatic activity of the 
target on an appropriate substrate, detecting the 
induction of a reporter gene (e.g., a Tango -77 -responsive 
regulatory element operably linked to a nucleic acid 

30 encoding a detectable marker, e.g. luciferase) , or 

detecting a cellular response, for example, inflammation. 

In yet another embodiment, an assay of the present 
invention is a cell -free assay comprising contacting a 
Tango-77 protein or biologically active portion thereof 

35 with a test compound and determining the ability of the 
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test compound to bind to the Tango-77 protein or 
biologically active portion thereof. Binding of the test 
compound to the Tango-77 protein can be determined either 
directly or indirectly as described above. In a 
5 preferred embodiment, the assay includes contacting the 
Tango-77 protein or biologically active portion thereof 
with a known compound which binds Tango-77 to form an 
assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test 

10 compound to interact with a Tango-77 protein, wherein 
determining the ability of the test compound to interact 
with a Tango-77 protein comprises determining the ability 
of the test compound to preferentially bind to Tango-77 
or biologically active portion thereof as compared to the 

15 known compound. 

In another embodiment, an assay is a cell -free 
assay comprising contacting Tango-77 protein or 
biologically active portion thereof with a test compound 
and determining the ability of the test compound to 

20 modulate (e.g., stimulate or inhibit) the activity of the 
Tango-77 protein or biologically active portion thereof. 
Determining the ability of the test compound to modulate 
the activity of Tango-77 can be accomplished, for 
example, by determining the ability of the Tango-77 

25 protein to bind to a Tango-77 target molecule by one of 
the methods described above for determining direct 
binding. In an alternative embodiment, determining the 
ability of the test compound to modulate the activity of 
Tango-77 can be accomplished by determining the ability 

30 of the Tango-77 protein to further modulate a Tango-77 
target molecule. For example, the catalytic/enzymatic 
activity of the target molecule on an appropriate 
substrate can be determined as previously described. 

In yet another embodiment, the cell -free assay 

35 comprises contacting the Tango-77 protein or biologically 
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active portion thereof with a known compound which binds 
Tango-77 to form an assay mixture, contacting the assay 
mixture with a test compound, and determining the ability 
of the test compound to interact with a Tango-77 protein, 
5 wherein determining the ability of the test compound to 
interact with a Tango-77 protein comprises determining 
the ability of the Tango-77 protein to preferentially 
bind to or modulate the activity of a Tango-77 target 
molecule . 

io It is possible that membrane -bound forms of Tango- 

77 exist. The cell-free assays of the present invention 
are amenable to use of both the forms Tango-77. In the 
case of cell-free assays comprising a membrane -bound form 
of Tango-77, it may be desirable to utilize a 

15 solubilizing agent such that the membrane -bound form of 
Tango-77 is maintained in solution. Examples of such 
solubilizing agents include non-ionic detergents such as 
n-octylglucoside , n-dodecylglucoside , n-dodecylmaltoside , 
octanoyl -N-methylglucamide , decanoyl -N-methylglucamide , 

20 Triton® X-100, Triton® X-114, Thesit®, 

Isotridecypoly (ethylene glycol ether)n, 3- [(3- 
cholamidopropyl) dimethylamminio] -1 -propane sulfonate 
(CHAPS) , 3- [ (3-cholamidopropyl) dimethylamminio] -2- 
hydroxy-1 -propane sulfonate (CHAPSO) , or N-dodecyl=N,N- 

25 dimethyl-3-ammonio-l-propane sulfonate. 

In more than one embodiment of the above assay 
methods of the present invention, it may be desirable to 
immobilize either Tango-77 or its target molecule to 
facilitate separation of complexed from uncomplexed forms 

30 of one or both of the proteins, as well as to accommodate 
automation of the assay. Binding of a test compound to 
Tango-77, or interaction of Tango-77 with a target 
molecule in the presence and absence of a candidate 
compound, can be accomplished in any vessel suitable for 

35 containing the reactants. Examples of such vessels 
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include microtitre plates, test tubes, and micro- 
centrifuge tubes. In one embodiment, a fusion protein 
can be provided which adds a domain that allows one or 
both of the proteins to be bound to a matrix. For 
example, glutathione-S-transferase/ Tango-77 fusion 
proteins or glutathione-S-transf erase/target fusion 
proteins can be adsorbed onto glutathione sepharose beads 
(Sigma Chemical Co.; St. Louis, MO) or glutathione 
derivatized microtitre plates, which are then combined 
with the test compound or the test compound and either 
the non-adsorbed target protein or Tango-77 protein, and 
the mixture incubated under conditions conducive to 
complex formation (e.g., at physiological conditions for 
salt and pH) . Following incubation, the beads or 
microtitre plate wells are washed to remove any unbound 
components and complex formation is measured either 
directly or indirectly, for example, as described above. 
Alternatively, the complexes can be dissociated from the 
matrix, and the level of Tango-77 binding or activity 
determined using standard techniques. 

Other techniques for immobilizing proteins on 
matrices can also be used in the screening assays of the 
invention. For example, either Tango-77 or its target 
molecule can be immobilized utilizing conjugation of 
biotin and streptavidin. Biotinylated Tango-77 or target 
molecules can be prepared from biotin-NHS (N-hydroxy- 
succinimide) using techniques well known in the art 
(e.g., biotinylation kit, Pierce Chemicals; Rockford, 
IL) , and immobilized in the wells of streptavidin- coated 
96 well plates (Pierce Chemical) . Alternatively, 
antibodies reactive with Tango-77 or target molecules but 
which do not interfere with binding of the Tango-77 
protein to its target molecule can be derivatized to the 
wells of the plate, and unbound target or Tango-77 
trapped in the wells by antibody conjugation. Methods 
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for detecting such complexes, in addition to those 
described above for the GST -immobilized complexes, 
include immunodetection of complexes using antibodies 
reactive with the Tango-77 or target molecule, as well as 
5 enzyme-linked assays which rely on detecting an enzymatic 
activity associated with the Tango-77 or target molecule. 

In another embodiment, modulators of Tango-77 
expression are identified in a method in which a cell is 
contacted with a candidate compound and the expression of 

10 Tango-77 mRNA or protein in the cell is determined. The 
level of expression of Tango-77 mRNA or protein in the 
presence of the candidate compound is compared to the 
level of expression of Tango-77 mRNA or protein in the 
absence of the candidate compound. The candidate 

15 compound can then be identified as a modulator of 
Tango-77 expression based on this comparison. For 
example, when expression of Tango-77 mRNA or protein is 
greater (statistically significantly greater) in the 
presence of the candidate compound than in its absence, 

20 the candidate compound is identified as a stimulator of 
Tango-77 mRNA or protein expression. Alternatively, when 
expression of Tango-77 mRNA or protein is less 
(statistically significantly less) in the presence of the 
candidate compound than in its absence, the candidate 

25 compound is identified as an inhibitor of Tango-77 mRNA 
or protein expression. The level of Tango-77 mRNA or 
protein expression in the cells can be determined by 
methods described herein for detecting Tango-77 mRNA or 
protein. 

30 In yet another aspect of the invention, the 

Tango-77 proteins can be used as "bait proteins" in a 
two-hybrid assay or three hybrid assay (see, e.g., U.S. 
Patent No. 5,283,317; Zervos et al . (1993) Cell 72:223- 
232; Madura et al . (1993) J". Biol. Chem. 268:12046-12054; 

35 Bartel et al . (1993) Bio/Techniques 14:920-924; Iwabuchi 
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et al. (1993) Oncogene 8:1693-1696; and PCT Publication 
No. WO 94/10300), to identify other proteins, which bind 
to or interact with Tango-77 ( "Tango- 77 -binding proteins" 
or "Tango-77-bp") and modulate Tango-77 activity. Such 
5 Tango- 77 -binding proteins are also likely to be involved 
in the propagation of signals by the Tango-77 proteins 
as, for example, upstream or downstream elements of the 
Tango-77 pathway. 

The two-hybrid system is based on the modular 

10 nature of most transcription factors, which consist of 
separable DNA-binding and activation domains. Briefly, 
the assay utilizes two different DNA constructs. In one 
construct, the gene that codes for Tango-77 is fused to a 
gene encoding the DNA binding domain of a known 

is transcription factor (e.g., GAL-4) . In the other 
construct, a DNA sequence, from a library of DNA 
sequences, that encodes an unidentified protein ("prey" 
or "sample") is fused to a gene that codes for the 
activation domain of the known transcription factor. If 

20 the "bait" and the "prey" proteins are able to interact, 
in vivo, forming an Tango- 77 -dependent complex, the DNA- 
binding and activation domains of the transcription 
factor are brought into close proximity. This proximity 
allows transcription of a reporter gene (e.g., LacZ) 

25 which is operably linked to a transcriptional regulatory 
site responsive to the transcription factor. Expression 
of the reporter gene can be detected and cell colonies 
containing the functional transcription factor can be 
isolated and used to obtain the cloned gene which encodes 

30 the protein which interacts with Tango-77. 

This invention further pertains to novel agents 
identified by the above -described screening assays and 
uses thereof for treatments as described herein. 
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B. Detection Assays 

Portions or fragments of the cDNA sequence 
identified herein (and the corresponding complete gene 
sequences) can be used in numerous ways as polynucleotide 
reagents. For example, the sequence can be used to: (i) 
map the respective gene on a chromosome and, thus, locate 
gene regions associated with genetic disease; (ii) 
identify an individual from a minute biological sample 
(tissue typing) ; and (iii) aid in forensic identification 
of a biological sample. These applications are described 
in the subsections below. 

1 . Chromosome Mapping 

Once the sequence (or a portion of the sequence) 
of a gene has been isolated, this sequence can be used to 
map the location of the gene on a chromosome. 
Accordingly, Tango-77 nucleic acid molecules described 
herein or fragments thereof, can be used to map the 
location of the Tango-77 gene(s) on a chromosome. The 
mapping of the Tango-77 sequences to chromosomes is an 
important first step in correlating these sequences with 
genes associated with disease. 

Briefly, a Tango-77 gene can be mapped to 
chromosomes by preparing PGR primers (preferably 15-25 bp 
in length) from the Tango-77 sequences. Computer 
analysis of Tango-77 sequences can be used to rapidly 
select primers that do not span more than one exon in the 
genomic DNA, thus complicating the amplification process. 
These primers can then be used for PGR screening of 
somatic cell hybrids containing individual human 
chromosomes . Only those hybrids containing the human 
gene corresponding to the Tango-77 sequences will yield 
an amplified fragment. 

Somatic cell hybrids are prepared by fusing 
somatic cells from different mammals (e.g., human and 



WO 99/06426 



PCT/US98/16102 



- 69 - 

mouse cells) . As hybrids of human and mouse cells grow 
and divide, they gradually lose human chromosomes in 
random order, but retain the mouse chromosomes. By using 
media in which mouse cells cannot grow (because they lack 
5 a particular enzyme) but in which human cells can, the 
one human chromosome that contains the gene encoding the 
needed enzyme, will be retained. By using various media, 
panels of hybrid cell lines can be established. Each 
cell line in a panel contains either a single human 

10 chromosome or a small number of human chromosomes, and a 
full set of mouse chromosomes, allowing easy mapping of 
individual genes to specific human chromosomes . 
(D'Eustachio et al . (1983) Science 220:919-924). Somatic 
cell hybrids containing only fragments of human 

15 chromosomes can also be produced by using human 
chromosomes with translocations and deletions. 

PCR mapping of somatic cell hybrids is a rapid 
procedure for assigning a particular sequence to a 
particular chromosome. Three or more sequences can be 

20 assigned per day using a single thermal cycler. Using 
the Tango- 77 sequences to design oligonucleotide primers, 
sublocalization can be achieved with panels of fragments 
from specific chromosomes. Other mapping strategies 
which can similarly be used to map a Tango- 77 sequence to 

25 its chromosome include in situ hybridization (described 
in Fan et al . (1990) Proc. Natl. Acad. Sci . USA 87:6223- 
27) , pre-screening with labeled flow- sorted chromosomes, 
and pre-selection by hybridization to chromosome specific 
cDNA libraries. 

30 Fluorescence in situ hybridization (FISH) of a DNA 

sequence to a metaphase chromosomal spread can further be 
used to provide a precise chromosomal location in one 
step. Chromosome spreads can be made using cells whose 
division has been blocked in metaphase by a chemical, 

35 e.g., colcemid that disrupts the mitotic spindle. The 
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chromosomes can be treated briefly with trypsin, and then 
stained with Giemsa. A pattern of light and dark bands 
develops on each chromosome, so that the chromosomes can 
be identified individually. The FISH technique can be 
5 used with a DNA sequence as short as 500 or 600 bases. 
However, clones larger than 1,000 bases have a higher 
likelihood of binding to a unique chromosomal location 
with sufficient signal intensity for simple detection. 
Preferably 1,000 bases, and more preferably 2,000 bases 

10 will suffice to get good results at a reasonable amount 
of time. For a review of this technique, see Verma et 
al . (Human Chromosomes: A Manual of Basic Techniques 
(Pergamon Press, New York, 1988)). 

Reagents for chromosome mapping can be used 

is individually to mark a single chromosome or a single site 
on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. 
Reagents corresponding to noncoding regions of the genes 
actually are preferred for mapping purposes. Coding 

20 sequences are more likely to be conserved within gene 
families, thus increasing the chance of cross 
hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise 
chromosomal location, the physical position of the 

25 sequence on the chromosome can be correlated with genetic 
map data. (Such data are found, for example, in V. 
McKusick, Mendelian Inheritance in Man, available on-line 
through Johns Hopkins University Welch Medical Library) . 
The relationship between genes and disease, mapped to the 

30 same chromosomal region, can then be identified through 
linkage analysis (co- inheritance of physically adjacent 
genes), described in, e.g., Egeland et al . (1987) JVature 
325:783-787. 

Moreover, differences in the DNA sequences between 
35 individuals affected and unaffected with a disease 
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associated with the Tango-77 gene can be determined. If 
a mutation is observed in some or all of the affected 
individuals but not in any unaffected individuals, then 
the mutation is likely to be the causative agent of the 
particular disease. Comparison of affected and 
unaffected individuals generally involves first looking 
for structural alterations in the chromosomes such as 
deletions or translocations that are visible from 
chromosome spreads or detectable using PCR based on that 
DNA sequence. Ultimately, complete sequencing of genes 
from several individuals can be performed to confirm the 
presence of a mutation and to distinguish mutations from 
polymorphisms . 

2. Tissue Typing 

The Tango-77 sequences of the present invention 
can also be used to identify individuals from minute 
biological samples. The United States military, for 
example, is considering the use of restriction fragment 
length polymorphism (RFLP) for identification of its 
personnel. In this technique, an individual's genomic 
DNA is digested with one or more restriction enzymes, and 
probed on a Southern blot to yield unique bands for 
identification. This method does not suffer from the 
current limitations of "Dog Tags" which can be lost, 
switched, or stolen, making positive identification 
difficult. The sequences of the present invention are 
useful as additional DNA markers for RFLP (described in 
U.S. Patent 5,272,057). 

Furthermore, the sequences of the present 
invention can be used to provide an alternative technique 
which determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the 
Tango-77 sequences described herein can be used to 
prepare two PCR primers from the 5' and 3' ends of the 
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sequences. These primers can then be used to amplify an 
individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from 
individuals, prepared in this manner, can provide unique 
5 individual identifications, as each individual will have 
a unique set of such DNA sequences due to allelic 
differences. The sequences of the present invention can 
be used to obtain such identification sequences from 
individuals and from tissue. The Tango-77 sequences of 
10 the invention uniquely represent portions of the human 
genome. Allelic variation occurs to some degree in the 
coding regions of these sequences, and to a greater 
degree in the noncoding regions. It is estimated that 
allelic variation between individual humans occurs with a 
is frequency of about once per each 500 bases. Each of the 
sequences described herein can, to some degree, be used 
as a standard against which DNA from an individual can be 
compared for identification purposes. Because greater 
numbers of polymorphisms occur in the noncoding regions, 
20 fewer sequences are necessary to differentiate 

individuals. The noncoding sequences of SEQ ID NO:l can 
comfortably provide positive individual identification 
with a panel of perhaps 10 to 1,000 primers which each 
yield a noncoding amplified sequence of 100 bases. If 
25 predicted coding sequences, such as those in SEQ ID NO: 3, 
SEQ ID NO: 6, or SEQ ID NO: 10 are used, a more appropriate 
number of primers for positive individual identification 
would be 500-2, 000. 

If a panel of reagents from Tango-77 sequences 
30 described herein is used to generate a unique 

identification database for an individual, those same 
reagents can later be used to identify tissue from that 
individual. Using the unique identification database, 
positive identification of the individual, living or 
35 dead, can be made from extremely small tissue samples. 
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3. Use of Partial Tango-77 Sequences in Forensic 

Biology 

DNA-based identification techniques can also be 
used in forensic biology. Forensic biology is a 
s scientific field employing genetic typing of biological 
evidence found at a crime scene as a means for positively 
identifying, for example, a perpetrator of a crime. To 
make such an identification, PCR technology can be used 
to amplify DNA sequences taken from very small biological 

10 samples such as tissues, e.g., hair or skin, or body 
fluids, e.g., blood, saliva, or semen found at a crime 
scene. The amplified sequence can then be compared to a 
standard, thereby allowing identification of the origin 
of the biological sample. 

is The sequences of the present invention can be used 

to provide polynucleotide reagents, e.g., PCR primers, 
targeted to specific loci in the human genome, which can 
enhance the reliability of DNA-based forensic 
identifications by, for example, providing another 

20 "identification marker" (i.e. another DNA sequence that 
is unique to a particular individual) . As mentioned 
above, actual base sequence information can be used for 
identification as an accurate alternative to patterns 
formed by restriction enzyme generated fragments. 

25 Sequences targeted to noncoding regions of SEQ ID NO:l 
are particularly appropriate for this use as greater 
numbers of polymorphisms occur in the noncoding regions, 
making it easier to differentiate individuals using this 
technique. Examples of polynucleotide reagents include 

30 the Tango-77 sequences or portions thereof, e.g., 

fragments derived from the noncoding regions of SEQ ID 
N0:1 having a length of at least 20 or 30 bases. 

The Tango- 77 sequences described herein can 
further be used to provide polynucleotide reagents, e.g., 

35 labeled or labelable probes which can be used in, for 
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example, an in situ hybridization technique, to identify 
a specific tissue, e.g., brain tissue. This can be very 
useful in cases where a forensic pathologist is presented 
with a tissue of unknown origin. Panels of such Tango- 77 

5 probes can be used to identify tissue by species and/or 
by organ type. 

In a similar fashion, these reagents, e.g., 
Tango-77 primers or probes can be used to screen tissue 
culture for contamination (i.e., screen for the presence 

10 of a mixture of different types of cells in a culture) . 

C . Predictive Medicine 

The present invention also pertains to the field 
of predictive medicine in which diagnostic assays, 
prognostic assays, pharmacogenomics, and monitoring 
is clinical trails are used for prognostic (predictive) 

purposes to thereby treat an individual prophylactically . 
Accordingly, one aspect of the present invention relates 
to diagnostic assays for determining Tango-77 protein 
and/or nucleic acid expression as well as Tango-77 
20 activity, in the context of a biological sample (e.g., 

blood, serum, cells, tissue) to thereby determine whether 
an individual is afflicted with a disease or disorder, or 
is at risk of developing a disorder, associated with 
aberrant Tango-77 expression or activity. The invention 
25 also provides for prognostic (or predictive) assays for 
determining whether an individual is at risk of 
developing a disorder associated with Tango-77 protein, 
nucleic acid expression or activity. For example, 
mutations in a Tango-77 gene can be assayed in a 
30 biological sample. Such assays can be used for 
prognostic or predictive purpose to thereby 
prophylactically treat an individual prior to the onset 
of a disorder characterized by or associated with 
Tango-77 protein, nucleic acid expression or activity. 
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Another aspect of the invention provides methods 
for determining Tango- 77 protein, nucleic acid expression 
or Tango- 77 activity in an individual to thereby select 
appropriate therapeutic or prophylactic agents for that 

i individual (referred to herein as "pharmacogenomics" ) . 
Pharmacogenomics allows for the selection of agents 
(e.g., drugs) for therapeutic or prophylactic treatment 
of an individual based on the genotype of the individual 
(e.g., the genotype of the individual examined to 

} determine the ability of the individual to respond to a 
particular agent.) 

Yet another aspect of the invention pertains to 
monitoring the influence of agents (e.g., drugs or other 
compounds) on the expression or activity of Tango- 77 in 

5 clinical trials. 

These and other agents are described in further 
detail in the following sections. 

1 . Diagnostic Assays 

An exemplary method for detecting the presence or 
o absence of Tango- 77 in a biological sample involves 
obtaining a biological sample from a test subject and 
contacting the biological sample with a compound or an 
agent capable of detecting Tango- 77 protein or nucleic 
acid (e.g., mRNA, genomic DNA) that encodes Tango-77 
5 protein such that the presence of Tango-77 is detected in 
the biological sample. A preferred agent for detecting 
Tango-77 mRNA or genomic DNA is a labeled nucleic acid 
probe capable of hybridizing to Tango-77 mRNA or genomic 
DNA. The nucleic acid probe can be, for example, a full- 
o length Tango-77 nucleic acid, such as the nucleic acid of 
SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID NO: 10 or a 
portion thereof, such as an oligonucleotide of at least 
15, 30, 50, 100, 250 or 500 nucleotides in length and 
sufficient to specifically hybridize under stringent 
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conditions to Tango- 77 mRNA or genomic DNA, Other 
suitable probes for use in the diagnostic assays of the 
invention are described herein. 

A preferred agent for detecting Tango- 77 protein 
5 is an antibody capable of binding to Tango-77 protein, 
preferably an antibody with a detectable label. 
Antibodies can be polyclonal, or more preferably, 
monoclonal. An intact antibody, or a fragment thereof 
(e.g., Fab or F(ab') 2 ) can be used. The term "labeled", 

10 with regard to the probe or antibody, is intended to 
encompass direct labeling of the probe or antibody by 
coupling (i.e., physically linking) a detectable 
substance to the probe or antibody, as well as indirect 
labeling of the probe or antibody by reactivity with 

is another reagent that is directly labeled. Examples of 
indirect labeling include detection of a primary antibody 
using a f luorescently labeled secondary antibody and end- 
labeling of a DNA probe with biotin such that it can be 
detected with f luorescently labeled streptavidin. The 

20 term "biological sample" is intended to include tissues, 
cells and biological fluids isolated from a subject, as 
well as tissues, cells and fluids present within a 
subject. That is, the detection method of the invention 
can be used to detect Tango-77 mRNA, protein, or genomic 

25 DNA in a biological sample in vitro as well as in vivo. 
For example, in vitro techniques for detection of 
Tango-77 mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of 
Tango-77 protein include enzyme linked immunosorbent 

30 assays (ELISAs) , Western blots, immunoprecipi tat ions and 
immunofluorescence. In vitro techniques for detection of 
Tango-77 genomic DNA include Southern hybridizations. 
Furthermore, in vivo techniques for detection of Tango-77 
protein include introducing into a subject a labeled 

25 anti -Tango- 77 antibody. For example, the antibody can be 
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labeled with a radioactive marker whose presence and 
location in a subject can be detected by standard imaging 
techniques . 

In one embodiment, the biological sample contains 

5 protein molecules from the test subject. Alternatively, 
the biological sample can contain mRNA molecules from the 
test subject or genomic DNA molecules from the test 
subject. A preferred biological sample is a peripheral 
blood leukocyte sample isolated by conventional means 

o from a subject. 

In another embodiment, the methods further involve 
obtaining a control biological sample from a control 
subject, contacting the control sample with a compound or 
agent capable of detecting Tango- 77 protein, mRNA, or 

5 genomic DNA, such that the presence of Tango- 77 protein, 
mRNA or genomic DNA is detected in the biological sample, 
and comparing the presence of Tango-77 protein, mRNA or 
genomic DNA in the control sample with the presence of 
Tango-77 protein, mRNA or genomic DNA in the test sample . 

o The invention also encompasses kits for detecting 

the presence of Tango-77 in a biological sample (a test 
sample) . Such kits can be used to determine if a subject 
is suffering from or is at increased risk of developing a 
disorder associated with aberrant expression of Tango-77 

5 (e.g., an immunological disorder). For example, the kit 
can comprise a labeled compound or agent capable of 
detecting Tango-77 protein or mRNA in a biological sample 
and means for determining the amount of Tango-77 in the 
sample (e.g., an anti-Tango-77 antibody or an 

o oligonucleotide probe which binds to DNA encoding 
Tango-77, e.g., SEQ ID NO:l or SEQ ID NO:3 or SEQ ID 
N0:6, or SEQ ID NO:10). Kits may also include 
instruction for observing that the tested subject is 
suffering from or is at risk of developing a disorder 

5 associated with aberrant expression of Tango-77 if the 
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amount of Tango- 77 protein or mRNA is above or below a 
normal level . 

For antibody-based kits, the kit may comprise, for 
example: (1) a first antibody (e.g., attached to a solid 

5 support) which binds to Tango- 77 protein; and, optionally 
(2) a second, different antibody which binds to Tango-77 
protein or the first antibody and is conjugated to a 
detectable agent . 

For oligonucleotide-based kits, the kit may 

o comprise, for example: (1) an oligonucleotide, e.g., a 
detectably labelled oligonucleotide, which hybridizes to 
a Tango-77 nucleic acid sequence or (2) a pair of primers 
useful for amplifying a Tango-77 nucleic acid molecule; 
The kit may also comprise, e.g., a buffering 

s agent, a preservative, or a protein stabilizing agent. 
The kit may also comprise components necessary for 
detecting the detectable agent (e.g., an enzyme or a 
substrate) . The kit may also contain a control sample or 
a series of control samples which can be assayed and 

o compared to the test sample contained. Each component of 
the kit is usually enclosed within an individual 
container and all of the various containers are within a 
single package along with instructions for observing 
whether the tested subject is suffering from or is at 

5 risk of developing a disorder associated with aberrant 
expression of Tango-77. 

2 . Prognostic Assays 

The methods described herein can furthermore be 
utilized as diagnostic or prognostic assays to identify 
o subjects having or at risk of developing a disease or 
disorder associated with aberrant Tango-77 expression or 
activity. For example, the assays described herein, such 
as the preceding diagnostic assays or the following 
assays, can be utilized to identify a subject having or 
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at risk of developing a disorder associated with aberrant 
expression or activity. Thus, the present invention 
provides a method in which a test sample is obtained from 
a subject and Tango-77 protein or nucleic acid (e.g., 
5 mRNA, genomic DNA) is detected, wherein the presence of 
Tango-77 protein or nucleic acid is diagnostic for a 
subject having or at risk of developing a disease or 
disorder associated with aberrant Tango-77 expression or 
activity. As used herein, a "test sample" refers to a 

10 biological sample obtained from a subject of interest. 
For example, a test sample can be a biological fluid 
(e.g., serum), cell sample, or tissue. 

Furthermore, the prognostic assays described 
herein can be used to determine whether a subject can be 

is administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small 
molecule, or other drug candidate) to treat a disease or 
disorder associated with aberrant Tango-77 expression or 
activity. For example, such methods can be used to 

20 determine whether a subject can be effectively treated 
with a specific agent or class of agents (e.g., agents of 
a type which decrease Tango-77 activity) . Thus, the 
present invention provides methods for determining 
whether a subject can be effectively treated with an 

25 agent for a disorder associated with aberrant Tango-77 
expression or activity in which a test sample is obtained 
and Tango-77 protein or nucleic acid is detected (e.g., 
wherein the presence of Tango-77 protein or nucleic acid 
is diagnostic for a subject that can be administered the 

30 agent to treat a disorder associated with aberrant 
Tango-77 expression or activity) . 

The methods of the invention can also be used to 
detect genetic lesions or mutations in a Tango-77 gene, 
thereby determining if a subject with the lesioned gene 

35 is at risk for a disorder characterized by aberrant 
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inflammation. In preferred embodiments, the methods 
include detecting, in a sample of cells from the subject, 
the presence or absence of a genetic lesion or mutation 
characterized by at least one of an alteration affecting 
5 the integrity of a gene encoding a Tango-77-protein, or 
the mis -express ion of the Tango-77 gene. For example, 
such genetic lesions or mutations can be detected by 
ascertaining the existence of at least one of: 1) a 
deletion of one or more nucleotides from a Tango-77 gene; 
10 2) an addition of one or more nucleotides to a Tango-77 
gene; 3) a substitution of one or more nucleotides of a 
Tango-77 gene; 4) a chromosomal rearrangement of a 
Tango-77 gene; 5) an alteration in the level of a 
messenger RNA transcript of a Tango-77 gene; 6) an 
15 aberrant modification of a Tango-77 gene, such as of the 
methylation pattern of the genomic DNA; 7) the presence 
of a non-wild type splicing pattern of a messenger RNA 
transcript of a Tango-77 gene; 8) a non-wild type level 
of a Tango-77-protein; 9) an allelic loss of a Tango-77 
20 gene, and 10) an inappropriate post-translational 

modification of a Tango-77-protein. As described herein, 
there are a large number of assay techniques known in the 
art which can be used for detecting lesions or mutations 
in a Tango-77 gene. A preferred biological sample is a 
25 peripheral blood leukocyte sample isolated by 
conventional means from a subject. 

In certain embodiments, detection of the lesion 
involves the use of a probe/primer in a polymerase chain 
reaction (PCR) (see, e.g., U.S. Patent Nos. 4,683,195 and 
30 4,683,202), such as anchor PCR or RACE PCR, or, 

alternatively, in a ligation chain reaction (LCR) (see, 
e.g., Landegran et al . (1988) Science 241:1077-1080; and 
Nakazawa et al - (1994) Proc. Natl. Acad. Sci. USA 91:360- 
364) , the latter of which can be particularly useful for 
35 detecting point mutations in the Tango- 77 -gene (see, 
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e.g., Abravaya et al . (1995) Nucleic Acids Res. 23:675- 
682) . This method can include the steps of collecting a 
sample of cells from a patient, isolating nucleic acid 
(e.g., genomic, mRNA or both) from the cells of the 
5 sample, contacting the nucleic acid sample with one or 
more primers which specifically hybridize to a Tango-77 
gene under conditions such that hybridization and 
amplification of the Tango-77-gene (if present) occurs, 
and detecting the presence or absence of an amplification 

10 product, or detecting the size of the amplification 

product and comparing the length to a control sample. It 
is anticipated that PCR and/or LCR may be desirable to 
use as a preliminary amplification step in conjunction 
with any of the techniques used for detecting mutations 

is described herein. 

Alternative amplification methods include: self 
sustained sequence replication (Guatelli et al . (1990) 
Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional 
amplification system (Kwoh, et al . (1989) Proc. Natl. 

20 Acad. Sci. USA 86:1173-1177), Q-Beta Replicase (Lizardi 
et al. (1988) Bio/Technology 6:1197), or any other 
nucleic acid amplification method, followed by the 
detection of the amplified molecules using techniques 
well known to those of skill in the art. These detection 

25 schemes are especially useful for the detection of 

nucleic acid molecules if such molecules are present in 
very low numbers. 

In an alternative embodiment, mutations in a 
Tango-77 gene from a sample cell can be identified by 

30 alterations in restriction enzyme cleavage patterns. For 
example, sample and control DNA is isolated, amplified 
(optionally) , digested with one or more restriction 
endonucleases, and fragment length sizes are determined 
by gel electrophoresis and compared. Differences in 

35 fragment length sizes between sample and control DNA 
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indicates mutations in the sample DNA. Moreover, the use 
of sequence specific ribozymes (see, e.g., U.S. Patent 
No. 5,498,531) can be used to score for the presence of 
specific mutations by development or loss of a ribozyme 

cleavage site. 

In other embodiments, genetic mutations in 
Tango- 77 can be identified by hybridizing a sample and 
control nucleic acids, e.g., DNA or RNA, to high density 
arrays containing hundreds or thousands of 
oligonucleotides probes (Cronin et al . (1996) Human 
Mutation 7:244-255; Kozal et al . (1996) Nature Medicine 
2:753-759). For example, genetic mutations in Tango-77 
can be identified in two-dimensional arrays containing 
light-generated DNA probes as described in Cronin et al . 
supra. Briefly, a first hybridization array of probes 
can be used to scan through long stretches of DNA in a 
sample and control to identify base changes between the 
sequences by making linear arrays of sequential 
overlapping probes. This step allows the identification 
o of point mutations. This step is followed by a second 
hybridization array that allows the characterization of 
specific mutations by using smaller, specialized probe 
arrays complementary to all variants or mutations 
detected. Each mutation array is composed of parallel 
5 probe sets, one complementary to the wild- type gene and 
the other complementary to the mutant gene. 

In yet another embodiment, any of a variety of 
sequencing reactions known in the art can be used to 
directly sequence the Tango-77 gene and detect mutations 
o by comparing the sequence of the sample Tango-77 with the 
corresponding wild-type (control) sequence. Examples of 
sequencing reactions include those based on techniques 
developed by Maxim and Gilbert ((1977) Proc. Natl. Acad. 
Sci. USA 74:560) or Sanger ((1977) Proc. Natl. Acad. 
5 Sci. USA 74:5463). It is also contemplated that any of a 
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variety of automated sequencing procedures can be 
utilized when performing the diagnostic assays ( (1995) 
Bio/Techniqxzes 19:448), including sequencing by mass 
spectrometry (see, e.g., PCT Publication No. WO 94/16101; 
5 Cohen et al . (1996) Adv. Chromatogr. 36:127-162; and 
Griffin et al . (1993) Appl. Biochem. Biotechnol. 38:147- 
159) . 

Other methods for detecting mutations in the 
Tango- 77 gene include methods in which protection from 

10 cleavage agents is used to detect mismatched bases in 
RNA/RNA or RNA/DNA heteroduplexes (Myers et al . (1985) 
Science 230:1242). In general, the technique of 
"mismatch cleavage" entails providing heteroduplexes 
formed by hybridizing (labeled) RNA or DNA containing the 

is wild-type Tango-77 sequence with potentially mutant RNA 
or DNA obtained from a tissue sample. The double- 
stranded duplexes are treated with an agent which cleaves 
single- stranded regions of the duplex such as which will 
exist due to basepair mismatches between the control and 

20 sample strands. RNA/DNA duplexes can be treated with 
RNase to digest mismatched regions, and DNA/DNA hybrids 
can be treated with SI nuclease to digest mismatched 
regions. In other embodiments, either DNA/DNA or RNA/DNA 
duplexes can be treated with hydroxylamine or osmium 

25 tetroxide and with piperidine in order to digest 

mismatched regions. After digestion of the mismatched 
regions, the resulting material is then separated by size 
on denaturing polyacryl amide gels to determine the site 
of mutation. See, e.g., Cotton et al. (1988) Proc. Natl. 

30 Acad. Sci. USA 85:4397; Saleeba et al. (1992) Methods 
Enzymol. 217:286-295. In a preferred embodiment, the 
control DNA or RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage 
reaction employs one or more proteins that recognize 

35 mismatched base pairs in double -stranded DNA (so called 
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"DNA mismatch repair" enzymes) in defined systems for 
detecting and mapping point mutations in Tango-77 cDNAs 
obtained from samples of cells. For example, the mutY 
enzyme of E. coli cleaves A at G/A mismatches and the 
5 thymidine DNA glycosylase from HeLa cells cleaves T at 
G/T mismatches (Hsu et al . (1994) Carcinogenesis 15:1657- 
1662) . According to an exemplary embodiment, a probe 
based on a Tango-77 sequence, e.g., a wild-type Tango-77 
sequence, is hybridized to a cDNA or other DNA product 

10 from a test cell(s) . The duplex is treated with a DNA 
mismatch repair enzyme, and the cleavage products, if 
any, can be detected from electrophoresis protocols or 
the like. See, e.g., U.S. Patent No. 5,459,039. 
In other embodiments, alterations in 

15 electrophoretic mobility will be used to identify 

mutations in Tango-77 genes. For example, single strand 
conformation polymorphism (SSCP) may be used to detect 
differences in electrophoretic mobility between mutant 
and wild type nucleic acids (Orita et al . (1989) Proc. 

20 Natl. Acad. Sci. USA 86:2766; see also Cotton (1993) 

Mutat. J?es. 285:125-144; Hayashi (1992) Genet Anal Tech 
Appl 9:73-79). Single- stranded DNA fragments of sample 
and control Tango-77 nucleic acids will be denatured and 
allowed to renature. The secondary structure of single- 

25 stranded nucleic acids varies according to sequence, and 
the resulting alteration in electrophoretic mobility 
enables the detection of even a single base change. The 
DNA fragments may be labeled or detected with labeled 
probes. The sensitivity of the assay may be enhanced by 

30 using RNA (rather than DNA) , in which the secondary 

structure is more sensitive to a change in sequence. In 
a preferred embodiment, the subject method utilizes 
heteroduplex analysis to separate double stranded 
heteroduplex molecules on the basis of changes in 
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electrophoretic mobility (Keen et al. (1991) Trends Genet 
7:5) . 

In yet another embodiment, the movement of mutant 
or wild-type fragments in polyacrylamide gels containing 
5 a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE) (Myers et al . (1985) 
Nature 313:495) . When DGGE is used as the method of 
analysis, DNA will be modified to insure that it does not 
completely denature, for example by adding a GC clamp of 

10 approximately 40 bp of high-melting GC-rich DNA by PCR. 
In a further embodiment, a temperature gradient is used 
in place of a denaturing gradient to identify differences 
in the mobility of control and sample DNA (Rosenbaum and 
Reissner (1987) Biophys. Chem. 265:12753). 

15 Examples of other techniques for detecting point 

mutations include, but are not limited to, selective 
oligonucleotide hybridization, selective amplification, 
or selective primer extension. For example, 
oligonucleotide primers may be prepared in which the 

20 known mutation is placed centrally and then hybridized to 
target DNA under conditions which permit hybridization 
only if a perfect match is found (Saiki et al . (1986) 
Nature 324:163); Saiki et al . (1989) Proc. Natl. Acad. 
Sci. USA 86:6230). Such allele specific oligonucleotides 

25 are hybridized to PCR amplified target DNA or a number of 
different mutations when the oligonucleotides are 
attached to the hybridizing membrane and hybridized with 
labeled target DNA. 

Alternatively, allele specific amplification 

30 technology which depends on selective PCR amplification 
may be used in conjunction with the instant invention. 
Oligonucleotides used as primers for specific 
amplification may carry the mutation of interest in the 
center of the molecule (so that amplification depends on 

35 differential hybridization) (Gibbs et al . (1989) Nucleic 
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Acids Res. 17:2437-2448) or at the extreme 3' end of one 
primer where, under appropriate conditions, mismatch can 
prevent or reduce polymerase extension (Prossner (1993) 
Tibtech 11:238) . In addition, it may be desirable to 
5 introduce a novel restriction site in the region of the 
mutation to create cleavage -based detection (Gasparini et 
al- (1992) Mol. Cell Probes 6:1). It is anticipated that 
in certain embodiments amplification may also be 
performed using Taq ligase for amplification (Barany 

10 (1991) Proc. Natl. Acad. Sci USA 88:189). In such cases, 
ligation will occur only if there is a perfect match at 
the 3' end of the 5' sequence making it possible to 
detect the presence of a known mutation at a specific 
site by looking for the presence or absence of 

15 amplification. 

The methods described herein may be performed, for 
example, by utilizing pre-packaged diagnostic kits 
comprising at least one probe nucleic acid or antibody 
reagent described herein, which may be conveniently used, 

20 e.g., in clinical settings to diagnose patients 

exhibiting symptoms or family history of a disease or 
illness involving a Tango- 77 gene. 

Furthermore, any cell type or tissue, preferably 
peripheral blood leukocytes, in which Tango- 77 is 

25 expressed may be utilized in the prognostic assays 
described herein. 

3 . Pharmacoqenomics 

Agents, or modulators which have a stimulatory or 
30 inhibitory effect on Tango-77 activity (e.g., Tango-77 
gene expression) as identified by a screening assay 
described herein can be administered to individuals to 
treat (prophylactically or therapeutically) disorders 
(e.g., acute or chronic inflammation and asthma) 
35 associated with aberrant Tango-77 activity. In 
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conjunction with such treatment, the pharmacogenomics 
(i.e., the study of the relationship between an 
individual's genotype and that individual's response to a 
foreign compound or drug) of the individual may be 
5 considered. Differences in metabolism of therapeutics 
can lead to severe toxicity or therapeutic failure by 
altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, 
the pharmacogenomics of the individual permits the 

10 selection of effective agents (e.g., drugs) for 
prophylactic or therapeutic treatments based on a 
consideration of the individual's genotype. Such 
pharmacogenomics can further be used to determine 
appropriate dosages and therapeutic regimens. 

is Accordingly, the activity of Tango- 77 protein, expression 
of Tango-77 nucleic acid, or mutation content of Tango- 77 
genes in an individual can be determined to thereby 
select appropriate agent (s) for therapeutic or 
prophylactic treatment of the individual. 

20 Pharmacogenomics deals with clinically significant 

hereditary variations in the response to drugs due to 
altered drug disposition and abnormal action in affected 
persons. See, e.g., Linder (1997) Clin. Chem. 
43 (2) :254-266. In general, two types of pharmacogenetic 

25 conditions can be differentiated. Genetic conditions 

transmitted as a single factor altering the way drugs act 
on the body are referred to as "altered drug action." 
Genetic conditions transmitted as single factors altering 
the way the body acts on drugs are referred to as 

30 "altered drug metabolism". These pharmacogenetic 
conditions can occur either as rare defects or as 
polymorphisms. For example, glucose -6 -phosphate 
dehydrogenase deficiency (G6PD) is a common inherited 
enzymopathy in which the main clinical complication is 

35 haemolysis after ingestion of oxidant drugs (anti- 
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malarials, sulfonamides, analgesics, nitrofurans) and 
consumption of fava beans. 

As an illustrative embodiment, the activity of 
drug metabolizing enzymes is a major determinant of both 
5 the intensity and duration of drug action. The discovery 
of genetic polymorphisms of drug metabolizing enzymes 
(e.g., N-acetyltransf erase 2 (NAT 2) and cytochrome P450 
enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug 

10 effects or show exaggerated drug response and serious 
toxicity after taking the standard and safe dose of a 
drug. These polymorphisms are expressed in two 
phenotypes in the population, the extensive metabolizer 
(EM) and poor metabolizer (PM) . The prevalence of PM is 

15 different among different populations. For example, the 
gene coding for CYP2D6 is highly polymorphic and several 
mutations have been identified in PM, which all lead to 
the absence of functional CYP2D6 . Poor metabolizers of 
CYP2D6 and CYP2C19 quite frequently experience 

20 exaggerated drug response and side effects when they 
receive standard doses. If a metabolite is the active 
therapeutic moiety, PM shows no therapeutic response, as 
demonstrated for the analgesic effect of codeine mediated 
by its CYP2D6-f ormed metabolite morphine. The other 

25 extreme are the so called ultra-rapid metabolizers who do 
not respond to standard doses. Recently, the molecular 
basis of ultra-rapid metabolism has been identified to be 
due to CYP2D6 gene amplification. 

Thus, the activity of Tango- 77 protein, expression 

30 of Tango-77 nucleic acid, or mutation content of Tango-77 
genes in an individual can be determined to thereby 
select appropriate agent (s) for therapeutic or 
prophylactic treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping 

35 of polymorphic alleles encoding drug -metabolizing enzymes 
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to the identification of an individual's drug 
responsiveness phenotype. This knowledge, when applied 
to dosing or drug selection, can avoid adverse reactions 
or therapeutic failure and thus enhance therapeutic or 
5 prophylactic efficiency when treating a subject with a 
Tango-77 modulator, such as a modulator identified by one 
of the exemplary screening assays described herein. 

4 . Monitoring of Effects During Clinical Trials 
Monitoring the influence of agents (e.g., drugs, 

10 compounds) on the expression or activity of Tango-77 

(e.g., the ability to modulate aberrant inflammation) can 
be applied not only in basic drug screening, but also in 
clinical trials. For example, the effectiveness of an 
agent, as determined by a screening assay as described 

is herein, to increase Tango-77 gene expression, increase 
protein levels, or upregulate Tango-77 activity, can be 
monitored in clinical trials of subjects exhibiting 
decreased Tango-77 gene expression, decreased protein 
levels, or downregulated Tango-77 activity. 

20 Alternatively, the effectiveness of an agent, as 

determined by a screening assay, to decrease Tango-77 
gene expression, decrease protein levels, or downregulate 
Tango-77 activity, can be monitored in clinical trials of 
subjects exhibiting increased Tango-77 gene expression, 

25 increased protein levels, or upregulated Tango-77 
activity. 

For example, and not by way of limitation, genes, 
including Tango-77, that are modulated in cells by 
treatment with an agent (e.g., compound, drug or small 
30 molecule) which modulates Tango-77 activity (e.g., as 
identified in a screening assay described herein) can be 
identified. Thus, to study the effect of agents on 
cellular proliferation disorders, for example, in a 
clinical trial, cells can be isolated and RNA prepared 
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and analyzed for the levels of expression of Tango- 77 and 
other genes implicated in the disorder. The levels of 
gene expression (i.e., a gene expression pattern) can be 
quantified by Northern blot analysis or RT-PCR, as 
5 described herein, or alternatively by measuring the 
amount of protein produced, by one of the methods as 
described herein, or by measuring the levels of activity 
of Tango-77 or other genes. In this way, the gene 
expression pattern can serve as a marker, indicative of 
10 the physiological response of the cells to the agent. 
Accordingly, this response state may be determined 
before, and at various points during, treatment of the 
individual with the agent . 

In a preferred embodiment, the present invention 
is provides a method for monitoring the effectiveness of 
treatment of a subject with an agent (e.g., an agonist, 
antagonist, peptidomimetic, protein, peptide, nucleic 
acid, small molecule, or other drug candidate identified 
by the screening assays described herein) comprising the 
20 steps of (i) obtaining a pre -administration sample from a 
subject prior to administration of the agent; (ii) 
detecting the level of expression of a Tango-77 protein, 
mRNA, or genomic DNA in the preadministration sample; 
(iii) obtaining one or more post-administration samples 
25 from the subject; (iv) detecting the level of expression 
or activity of the Tango-77 protein, mRNA, or genomic DNA 
in the post-administration samples; (v) comparing the 
level of expression or activity of the Tango-77 protein, 
mRNA, or genomic DNA in the pre-administration sample 
30 with the Tango-77 protein, mRNA, or genomic DNA in the 
post administration sample or samples; and (vi) altering 
the administration of the agent to the subject 
accordingly. For example, increased administration of 
the agent may be desirable to increase the expression or 
35 activity of Tango-77 to higher levels than detected, 
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i.e., to increase the effectiveness of the agent. 
Alternatively, decreased administration of the agent may 
be desirable to decrease expression or activity of 
Tango-77 to lower levels than detected, i.e., to decrease 
5 the effectiveness of the agent . 

C. Methods of Treatment 
The present invention provides for both 
prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) developing or 

10 having a disorder associated with aberrant Tango-77 
expression or activity. Alternatively, disorders 
associated with aberrant IL-1 production can be treated 
with Tango-77. Such disorders include acute and chronic 
inflammation, asthma, some classes of arthritis, 

is autoimmune diabetes, systemic lupus erythematosus and 
inflammatory bowel disease. 

1 . Prophylactic Methods 
In one aspect, the invention provides a method for 
preventing in a subject, a disease or condition 

20 associated with an aberrant Tango-77 expression or 

activity (or aberrant IL-1 expression or activity) , by 
administering to the subject an agent which modulates 
Tango-77 expression or at least one Tango-77 activity. 
Subjects at risk for a disease which is caused or 

25 contributed to by aberrant Tango-77 expression or 
activity can be identified by, for example, any or a 
combination of diagnostic or prognostic assays as 
described herein. Administration of a prophylactic agent 
can occur prior to the manifestation of symptoms 

30 characteristic of the Tango-77 aberrancy, such that a 
disease or disorder is prevented or, alternatively, 
delayed in its progression. Depending on the type of 
Tango-77 aberrancy, for example, a Tango-77 agonist or 
Tango-77 antagonist agent can be used for treating the 
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subject. The appropriate agent can be determined based 
on screening assays described herein. 

2 . Therapeutic Methods 

Another aspect of the invention pertains to 
5 methods of modulating Tango-77 expression or activity for 
therapeutic purposes. The modulatory method of the 
invention involves contacting a cell with an agent that 
modulates one or more of the activities of Tango-77 
protein activity associated with the cell. An agent that 

10 modulates Tango-77 protein activity can be an agent as 
described herein, such as a nucleic acid or a protein, a 
naturally-occurring cognate ligand of a Tango-77 protein, 
a peptide, a Tango-77 peptidomimetic, or other small 
molecule. In one embodiment, the agent stimulates one or 

is more of the biological activities of Tango-77 protein. 
Examples of such stimulatory agents include active 
Tango-77 protein and a nucleic acid molecule encoding 
Tango-77 that has been introduced into the cell. In 
another embodiment, the agent inhibits one or more of the 

20 biological activities of Tango-77 protein. Examples of 
such inhibitory agents include antisense Tango-77 nucleic 
acid molecules and anti -Tango- 77 antibodies. These 
modulatory methods can be performed in vitro (e.g., by 
culturing the cell with the agent) or, alternatively, in 

25 vivo (e.g, by administering the agent to a subject) . As 
such, the present invention provides methods of treating 
an individual afflicted with a disease or disorder 
characterized by aberrant expression or activity of a 
Tango-77 protein or nucleic acid molecule. In one 

30 embodiment, the method involves administering an agent 

(e.g., an agent identified by a screening assay described 
herein), or combination of agents that modulates (e.g., 
upregulates or downregulates) Tango-77 expression or 
activity. In another embodiment, the method involves 
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administering a Tango- 77 protein or nucleic acid molecule 
as therapy to compensate for reduced or aberrant Tango- 77 
expression or activity. 

Stimulation of Tango-77 activity is desirable in 
5 situations in which Tango-77 is abnormally downregulated 
and/or in which increased Tango-77 activity is likely to 
have a beneficial effect. Conversely, inhibition of 
Tango-77 activity is desirable in situations in which 
Tango-77 is abnormally upregulated and/or in which 

10 decreased Tango-77 activity is likely to have a 
beneficial effect . 

This invention is further illustrated by the 
following examples which should not be construed as 
limiting. The contents of all references, patents and 

15 published patent applications cited throughout this 
application are hereby incorporated by reference. 

EXAMPLES 

Example 1: Isolation and Characterization of Human 
Tango-77 cDNAs 

20 Cytokine genes IL-la, IL-10 and IL-lra have been 

found to be closely clustered on chromosome 2, i.e., 
IL-la, IL-ljS and IL-lra are located within 450 kb of each 
other. BAG clones containing IL-la and IL-ljS were used 
to identify other proximal unknown cytokine genes. To do 

25 this, a BAC clone containing IL-la and IL-ljS was selected 
from a BAC library (Research Genetics, Huntsville, 
Alabama) using specific primers designed against IL-lof 
and IL-10. The DNA from the BAC was extracted and used 
to make a random- sheared genomic library. From this BAC 

30 library, 4000 clones were selected for sequencing. The 
resulting genomic sequences were then assembled into 
contigs and used to screen proprietary and public data 
bases. One genomic contig was found to contain two 



WO 99/06426 



PCT/US98/16102 



- 94 - 

segments of sequences which resemble IL-lra. These two 
segments are potential exons of Tango-77 gene. 

Two PCR primers were then designed from the two 
potential exons and used to screen a panel of cDNA 
5 libraries for the expression of a Tango-77 message. A 
cDNA library from TNF-a treated human lung epithelia 
showed a positive band of the predicted size (i.e., if 
the two exons are spliced together) . Using the PCR 
fragment as a probe, a single cDNA clone was isolated 

10 from the same library. This cDNA contains an insert of 
989 bp. The cDNA clone contains three possible open 
reading frames. The first open reading frame encompasses 
534 nucleotides (nucleotides 356-889 of SEQ ID N0:1; SEQ 
ID NO: 3) and encodes a 178 amino acid protein (SEQ ID 

is NO: 2) . This protein may include a predicted signal 
sequence of about 63 amino acids (from amino, acid 1 to 
about amino acid 63 of SEQ ID N0:2 (SEQ ID N0:4)) and a 
predicted mature protein of about 115 amino acids (from 
about amino acid 64 to amino acid 178 of SEQ ID NO: 2 (SEQ 

20 ID NO:5) ) . 

The second putative nucleotide open reading frame 
encompasses 498 nucleotides (nucleotides 389-889 of SEQ 
ID NO:l; SEQ ID NO: 6) and encodes a 167 amino acid 
protein (SEQ ID NO:7). This protein includes a predicted 

25 signal sequence of about 52 amino acids (from amino acid 
1 to about amino acid 52 of SEQ ID NO:7 (SEQ ID NO:8)) 
and a predicted mature protein of about 115 amino acids 
(from about amino acid 53 to amino acid 167 of SEQ ID 
NO: 7 (SEQ ID NO: 9) ) . 

30 The third open reading frame (nucleotides 372-889 

of SEQ ID N0:1; SEQ ID NO: 10) encompasses 408 nucleotides 
and encodes a 136 amino acid protein (SEQ ID NO: 11) . 
This protein includes a predicted signal sequence of 
about 21 amino acids (from amino acid 1 to about amino 

35 acid 21 of SEQ ID NO: 11 (SEQ ID NO: 12)) and a predicted 
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mature protein of about 115 amino acids (from about amino 
acid 22 to amino acid 136 of SEQ ID NO: 11 (SEQ ID 
NO: 13) ) . 

Tango-77 is predicted to be 35% identical to human 
5 IL-lra at the amino acid level. 

Example 2: Expression of Tanao-77 mRNA in Human Tissues 

The expression of Tango-77 was analyzed using 
Northern blot hybridization. A PCR generated 989 bp 
Tango-77 product was radioactively labeled with 32 P-dCTP 

10 using the Prime-It kit (Stratagene; La Jolla, CA) 

according to the instructions of the supplier. Filters 
containing human mRNA (MTNI and MTNII: Clontech; Palo 
Alto, CA) were probed in ExpressHyb hybridization 
solution (Clontech) and washed at high stringency 

15 according to manufacturer's recommendations. 

Tango-77 mRNA was not detected in any unstimulated 
tissues (brain, liver, spleen, skeletal muscle, testis, 
pancreas, heart, kidney and peripheral blood leukocytes) 
mRNA on Clontech Northern blots. 

20 Over 96 cDNA libraries were then tested for the 

presence of Tango-77 using PCR amplification. Only three 
libraries displayed a positive signal. These libraries 
were the TNFof-treated bronchoepithelium, TNFa-treated SSC 
cell line and ant i-CD3- treated T cells. 

25 Example 3: Characterization of Tanao-77 Proteins 

In this example, the predicted amino acid sequence 
of human Tango-77 protein was compared to the amino acid 
sequence of known protein IL-lra. In addition, the 
molecular weight of the human Tango-77 proteins was 

30 predicted. 

The human Tango-77 cDNA (Figure 1; SEQ ID NO:l) 
isolated as described above encodes a 178 amino acid 
protein (Figure 1; SEQ ID NO: 2) or a 167 amino acid 
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protein (Figure 1; SEQ ID NO: 7) or a 136 amino acid 
protein (Figure 1; SEQ ID NO:ll) . The signal peptide 
prediction program SIGNALP Optimized Tool (Nielsen et al . 
(1997) Protein Engineering 10:1-6) predicted that 
5 Tango-77 includes a 63 amino acid signal peptide (amino 
acid 1 to about amino acid 63 of SEQ ID NO: 2 (SEQ ID 
NO:4)) preceding the 115 mature protein; or preceding the 
115 mature protein (about amino acid 52 to amino acid 167 
of SEQ ID NO:7 (SEQ ID NO:8)); or preceding the 115 
io mature protein (about amino acid 21 to amino acid 136 of 

SEQ ID NO: 11; SEQ ID NO:12). 

As shown in Figure 2, Tango-77 has a region of 
homology to IL-lra (SEQ ID NO:14). 

Mature Tango-77 has a predicted MW of about 13 kDa 
15 and the predicted MW for the immature Tango-77 is 19.6 
kDa, 18.5 kDa or 15.2 kDa, not including post- 
translational modifications. 

Example 4: Preparation of Tango-77 Proteins 

Recombinant Tango-77 can be produced in a variety 

20 of expression systems. For example, the mature Tango-77 
peptide can be expressed as a recombinant glutathione-S- 
transf erase (GST) fusion protein in E. coli and the 
fusion protein can be isolated and characterized. 
Specifically, as described above, Tango-77 can be fused 

25 to GST and this fusion protein can be expressed in E. 
coli strain PEB199. Expression of the GST-Tango-77 
fusion protein in PEB199 can be induced with IPTG. The 
recombinant fusion protein can be purified from crude 
bacterial lysates of the induced PEB199 strain by 

30 affinity chromatography on glutathione beads. 
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Example 5: Alternatively spliced forms of IL-lra and 

Tango- 77 

Computer program Procrustes (Gelf and et al . , 1996, 
Proc. Natl. Acad. Sci. USA, 93:9061-9066) is an alignment 
5 algorithm that predicts the presence of alternatively- 
spliced exons for a protein of interest in a stretch of 
genomic DNA. Using the IL-lra sequence, Proscustes was 
used to search for the presence of additional sequences 
that might encode for alternatively spliced forms of IL- 

10 Ira in the two overlapping BAC genomic sequences (see 
Fig. 3 and Fig. 4) . Potential sequences that encode 
variant exons for IL-lra were identified. These 
predicted exons aligned well with the N-terminal region 
of IL-lra, but were not present in Tango-77. The results 

is from Procrustes predicts the existence of more spliced 
forms of IL-lra. 

Furthermore, Procrustes also predicted an 
additional sequence in BAC1 and BAC2 that encodes an 
alternatively spliced exon for Tango-77 (T77-procrustes; 

20 Fig. 5) . This predicted splice variant form of Tango-77, 
T77-procrustes, was aligned with Tango-77 (Fig. 6) and 
with IL-lra and IL-ljS (Fig. 7) . 

PGR primers within this sequence can be used to 
generate a product that can be used to screen a panel of 

25 cDNA libraries using standard techniques. Suitable cDNA 
libraries include libraries made from TNFa-treated 
bronchoepithelium, TNFa-treated SSC cell line and anti- 
CD3-treated T cells. The resulting cDNA clone (s) can be 
isolated from the library and sequenced to identify 

30 additional Tango-77 cDNAs. 
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Equivalents 

Those skilled in the art will recognize, or be 
able to ascertain using no more than routine 
experimentation, many equivalents to the specific 
5 embodiments of the invention described herein. Such 
equivalents are intended to be encompassed by the 
following claims. 
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What is claimed is: 

1- An isolated nucleic acid molecule selected 
from the group consisting of: 

a) a nucleic acid molecule comprising a 
5 nucleotide sequence which is at least 45% identical to 
the nucleotide sequence of SEQ ID NO:!, SEQ ID NO: 3, SEQ 
ID NO: 6, SEQ ID NO: 10, the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98807, or a 
complement thereof; 
10 b) a nucleic acid molecule comprising a fragment 

of at least 300 nucleotides of the nucleotide sequence of 
SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO : 6 , SEQ ID NO: 10, the 

cDNA insert of the plasmid deposited with ATCC as 
Accession Number 98807, or a complement thereof; 

is c) nucleic acid molecule which encodes a 

polypeptide comprising the amino acid sequence of SEQ ID 
N0:2, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:8, 
SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or 
an amino acid sequence encoded by the cDNA insert of the 

20 plasmid deposited with ATCC as Accession Number 98807; 

d) a nucleic acid molecule which encodes a 
fragment of a polypeptide comprising the amino acid 
sequence of SEQ ID N0:2, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID 
NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID N0:11, SEQ ID 

25 NO: 12, SEQ ID NO: 13, wherein the fragment comprises at 
least 15 contiguous amino acids of SEQ ID NO:2, SEQ ID 
NO:4, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, 
SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, or the 
polypeptide encoded by the cDNA insert of the plasmid 

30 deposited with ATCC as Accession Number 98807; and 

e) a nucleic acid molecule which encodes a 
naturally occurring allelic variant of a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 2, SEQ ID 
N0:4, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, 
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SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, or an amino 
acid sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98807, wherein 
the nucleic acid molecule hybridizes to a nucleic acid 
5 molecule comprising SEQ ID NO:l, SEQ ID NO: 3, SEQ ID 
NO: 6, SEQ ID NO: 10, or the complement thereof under 
stringent conditions . 

2 . The isolated nucleic acid molecule of claim 
1, which is selected from the group consisting of: 

10 a) a nucleic acid comprising the nucleotide 

sequence of SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 6, or SEQ 

ID NO: 10 or the cDNA insert of the plasmid deposited with 
ATCC as Accession Number 98807 , or a complement thereof; 
and 

is b) a nucleic acid molecule which encodes a 

polypeptide comprising the amino acid sequence of SEQ ID 

NO: 2, SEQ ID N0:4, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, 
SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO: 12, SEQ ID NO:13, or 
an amino acid sequence encoded by the cDNA insert of the 
20 plasmid deposited with ATCC as Accession Number 98807. 

3 . The nucleic acid molecule of claim 1 further 
comprising vector nucleic acid sequences. 

4 . The nucleic acid molecule of claim 1 further 
comprising nucleic acid sequences encoding a heterologous 

25 polypeptide. 

5 . A host cell containing the nucleic acid 
molecule of claim 1 . 

6. The host cell of claim 5 which is a mammalian 
host cell. 
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7. A non-human mammalian host cell containing 
the nucleic acid molecule of claim 1. 

8. An isolated polypeptide selected from the 
group consisting of: 

5 a) a fragment of a polypeptide comprising the 

amino acid sequence of SEQ ID NO:2, SEQ ID NO: 4, SEQ ID 
NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID 
NO: 11, SEQ ID NO: 12 , SEQ ID NO: 13, wherein the fragment 
comprises at least 15 contiguous amino acids of SEQ ID 
10 NO: 2, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, 
SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO : 12 , or SEQ ID NO: 13. 

b) a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:8, 

15 SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or 

an amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with ATCC as Accession Number 98807, 
wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid molecule 
20 comprising SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO: 6, SEQ ID 
NO: 10 or the complement thereof under stringent 
conditions; 

c) a polypeptide which is encoded by a nucleic 
acid molecule comprising a nucleotide sequence which is 

25 at least 55% identical to a nucleic acid comprising the 
nucleotide sequence of SEQ ID NO:l, SEQ ID NO: 3, SEQ ID 
NO: 6, or SEQ ID NO: 10. 

9. The isolated polypeptide of claim 8 
comprising the amino acid sequence of SEQ ID NO: 2, SEQ ID 

30 NO: 4, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or an amino 
acid sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98807. 
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10. The polypeptide of claim 8 further comprising 
heterologous amino acid sequences. 

11. An antibody which selectively binds to a 
polypeptide of claim 8. 

5 12 . A method for producing a polypeptide selected 

from the group consisting of: 

a) a polypeptide comprising the amino acid 
sequence of SEQ ID N0:2, SEQ ID NO:4, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID 

10 NO: 12, SEQ ID NO: 13, or an amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with ATCC as 
Accession Number 98807; 

b) a fragment of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 

15 NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID 
NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or an amino acid 
sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98807, wherein 
the fragment comprises at least 15 contiguous amino acids 

20 of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 7, 

SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 11, SEQ ID NO:12, SEQ 
ID NO: 13, or an amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 
Number 98807; and 

25 c) a naturally occurring allelic variant of a 

polypeptide comprising the amino acid sequence of SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:8, 
SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, or 
an amino acid sequence encoded by the cDNA insert of the 

30 plasmid deposited with ATCC as Accession Number 98807, 
wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid sequence of 
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SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO : 6 , or SEQ ID NO: 10 
under stringent conditions; 

comprising culturing the host cell of claim 5 
under conditions in which the nucleic acid molecule is 
5 expressed. 

13 . A method for detecting the presence of a 
polypeptide of claim 8 in a sample, comprising: 

a) contacting the sample with a compound which 
selectively binds to a polypeptide of claim 8; and 
10 b) determining whether the compound binds to the 

polypeptide in the sample. 

14. The method of claim 13, wherein the compound 
which binds to the polypeptide is an antibody. 

15. A kit comprising a compound which selectively 
is binds to a polypeptide of claim 8 and instructions for 

use . 

16. A method for detecting the presence of a 
nucleic acid molecule of claim 1 in a sample, comprising 
the steps of : 

20 a) contacting the sample with a nucleic acid 

probe or primer which selectively hybridizes to the 
nucleic acid molecule; and 



WO 99/06426 



PCTYUS98/16102 



- 104 - 

18. A kit comprising a compound which selectively 
hybridizes to a nucleic acid molecule of claim 1 and 
instructions for use. 



19. A method for identifying a compound which 
5 binds to a polypeptide of claim 8 comprising the steps 

of: 

a) contacting a polypeptide, or a cell 
expressing a polypeptide of claim 8 with a test compound; 
and 

io b) determining whether the polypeptide binds to 

the test compound, 

20. The method of claim 19, wherein the binding 
of the test compound to the polypeptide is detected by a 
method selected from the group consisting of: 

15 a) detection of binding by direct detecting of 

test compound/polypeptide binding; 

b) detection of binding using a competition 
binding assay; and 

c) detection of binding using an assay for 
20 Tango-77-mediated signal transduction. s 

21. A method for modulating the activity of a 
polypeptide of claim 8 comprising contacting a 
polypeptide or a cell expressing a polypeptide of claim 8 
with a compound which binds to the polypeptide in a 

25 sufficient concentration to modulate the activity of the 
polypeptide . 
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22. A method for identifying a compound which 
modulates the activity of a polypeptide of claim 8, 
comprising: 

a) contacting a polypeptide of claim 8 with a 
5 test compound; and 

b) determining the effect of the test compound 
on the activity of the polypeptide to thereby identify a 
compound which modulates the activity of the polypeptide. 
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3TCGACCCACGCGTCCGCAGACGTCTACCTGGGGGTCCCGTCTGCGCTCCCGGGATGGAAAACGCCCAGGGGAAACTTA 79 

GGCAGGCGAGCGGACGGGCACCTCCCGCGGGACGAACTCACTCGGTGGCCTCCTACTTCCCCGGCCGTGTTCCAACGCC 158 

TGAGAATAACGGGAACAGCGGTCGTACTCACCGACAGCGGCAGCAGCGGCCTCTCTCAATTGGGCAAAGCACTCCAGAC 237 

CTTTTSGAAGAGTGACACCAAAGGCAAGCACCTGCTTGGCAGGCCCCTtt^ 31S 

M S r V G 2 N S G V 10 

CATTCCATTTTCTGTTGAGTAATAAACTCAACGTTGAAA ATG TCC TTT GTG GGG GAG AAC TCA GGA GTG 385 

KMG5SOW3KDS?0CCL»SD?A 30 

AAA ATG GGG TCT GAG GAC TGG GAA AAA GAT GAA CCC CAG TGC TGC TTA GAA GAC CCG GCT 445 

3 3 PLEPGPSLPTMMFVHTXI 50 

GGA AGC CCC CTG GAA CCA GGC CCA AGC CTC CCC ACC ATG AAT TTT GTT CAC ACA AAG ATC 505 

fFALAS SLSSASAEXGSPCL 70 

TTC TTT GCA TTA GCC TCA TCC TTG AGC TCA GCC TCT GCG GAG AAA GGA AGT CCG ATT CTC 565 

^ 3 7 3XGSFC-YCDXDXGQ5H 90 

CTG GGG GTC TCT AAA GGG GAG TTT TGT CTC TAC TGT GAC AAG GAT AAA GGA CAA AGT CAT 625 

? s i q l :< x s :< l m :< i a a q :< s s a no 

CCA TCC CTT CAG CTG AAG AAG GAG AAA CTG ATG AAG CTG GCT GCC CAA AAG GAA TCA GCA 685 

?. ?. ? T I 7 Y R A 3 7 G 3 W N ■ M 1 E 5 A 130 

CGC C3G CCC TTC ATC TTT TAT AGG GCT CAG GTG GGC TCC TGG AAC ATG CTG GAG TCG GCG 745 

A HPGWF ICTSCNCN3PVGVT 150 

GCT CAC CCC GGA TGG TTC ATC TGC ACC TCC TGC AAT TGT AAT GAG CCT GTT GGG GTG ACA 805 

D XF2NR XH I S FS -FQPVCXAE 170 

GAT AAA TTT GAG AAC AGG AAA CAC ATT GAA TTT TCA TTT CAA CCA GTT TGC AAA GCT GAA 865 

M5P3SVSD* 179 

ATG AGC CCC AGT GAG GTC AGC GAT TAG 892 

GAAACTGCCCCATTGAACGCCTTCCTCGCTAATTTGAACTAATTGTATAAAAACACCAAACCTGCTCACTAAAAAAAAA 971 

AAAAAAAAGGGCGGCCGC 989 



Fig. 1 
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>Concigi 

GAAG7GAAGA7A7AA7G7A7AG7AG7AA7A7A7AA7G77AGG7GAA77AA 
AGGAAATAGAATATATTGGGGAGTAATTATGGG7GTAAAGAAATATAGTA 
GGGAAG7A777AGA777GAGAAAAAAAAAAAGGAA777AG7G7AGG7GAA 
NAA7AAAAGNANAAGG7TAAAAA77AAAAAAAAA77AAA7A7AAA7AAA7 
AAA7AAAAA7AAAAA7AAAA7AAAAAA777AAAAAA77AAAAAAA7ATAA 
AAAATAAAGAAATGGAAGTGGATTCTTAGAAAAAAAAGAAAGTAAGGTGA 
TA7GAGGAGATAGAGAGGATGTGGTGTGAGATGATTGGTTTAATTAGAAA 
ATAGGTTTTGAATAGAGTGGGAAAGTAGAGTTTTGGTAAATGTGGGGGGA 
AGAGGG7AATG7TGTTTGAGTGAAAGAAAAAATGGTATATTTTTATAAAA 
TAATGAGGAAAGTGTGTGAAAAAAAAATTATTGGGATTTGGGAAGGTGAT 
ATATAAAGTTG7GGAAAATTTGGGGGGTGGGGTTTATTTAGGATTAAAAA 
GTTATTTAAAGAATGAAAATGAATTTTTGTTTGTAATTTGGGGATAAGAA 
ATTAATG7TTAGAAAGAAAGGGAAAAAATTGAAGAAAAAAATTTAGATTT 
TGGAAATTTAAAAATATTGTGGGTGTAAATAGGAAGGATTTTTAAAGGTA 
ATTGTGGAAGGGATTTGTGTGGAAAATAATAGGGAGAAAAAATGGGG 

>Concig2 - 

GCATCTAACTGGAGCCTGCATTATTACAGATTTAGCATCACCAAAGTCTA 
AACAATTAGACTGACTAAGGCAGAACTGCCCTTATGACAGCAGACATAAG 
AAGGAAAAGGCCAAAACACTGTGTTAAAAATTATCCAAATGTGAGGAAAA 
GGCAAAGAGAGTAGGTGTGCCTTTTTAGTGTCTAAGC7GCCTGCCCAAGG 
GGCA7CTGA7GC7C7CAGGCAGGAG7CCACAAA7777T7777G7AAAAGA 
TCAGATAG7AAA7CTT77CAGCG7GAAGAGCA7GAGG7C7C7G7CACAAA 
TACTCAACCACCA77ACAACA7GAAAGCAGCCAACAGACAACACA7GACA 
AA7GAGTGTGGC7G7G77CCAG7AAA7C7TGA77ACAAAAACAGGCAAGA 
GGCCAGAGC7GACCCATGGGCCA7AG777GC7GACCCC77CTG7AAAGGA 
AAG7A77777G777GAC77GC7G777ACCA77GA77GAACACAAGGC7CT 
GTAAAG77AC77G77AAC77GCAGAAGA77GA7GAG7GGCAAG7AA777T 

TATTCACCAGAA7A7AAAA7TA7T7C7GT7CAG7AGAAAAGATAAACCAA 

CTGTGA7A77A7GG7CCTG 

>Contig3 

GGGGTG7C7G7C7ACCA7G7GC7CGCAG77C7G7AA7AAA7G77CTC7CA 

AGATCC77AAAA7C7C7TGGAAA77A7AAAAA7AT7GGAAAGAGAAGAAC 

AG7T777AAAA7A7A7A7A7ATA7A7A7A77777T77GAGA7GGAG7C7T 

GC7C7G7CG7CCAGGC7GGAG7GCAGTGGCGCAAAC77GG77CACCACAA 

CC7C7GCC7CCCGGG77CAAGCGA77C77C7GCCTCAGCC7CC7GAG7AG 

CTGGGAC7ACAGGCGCCCGCCACCACGCCCAGC7AA77777G7A77777A 

G7AGAGACGAGG7777AC7ATG77GGC7AGGC7GG7CTCAAAC7CC7GAC 

C77G7GA7C7GCCCGCC77GGCCTCCCAAAG7GCTGGGA77ACAGG7G7G 

AGCCAC7GCACC7GGCCAGTTT7T7AAA7A7A77T77AAAAACAC77GAA 

7AAGAG7CAG7G7AAACTAGAAG7TTAAAAATGCTTCACAGAACACCCAG 

GG7T7ACA77ACAAGATTCTCACAACAAACCTA7TG7AAAGGTGAG7AAG 

GCA7G77A77ACAGAGAAAAGTTTGGGAGCAAAAC7G7AAAAAA7TATAT 

7T77G77G7A77T7C7AAGAGAAAGAG7A7TG77A7G7TC7CC7AACC7C 

7G7TGA77AC7AC7T7AAGTGA7TTCC7TGAGAGCACA7GA7GA7CC 
>Concig4 

GCCG77CA7AGAAAAC7GAAAGCAA7AAGA7GACTAGG7AAGCA7GACAT 
77AAAAGG7A77CA7GGGACGTGG77ACAAAACCAAC7CACAAC7AAAAA 
G7C77AGGACCTC7CGC7GAC77AGGAGCC7GA7CCCAAC7C7GAGAA7G 
AC7CAG7G7GT7ACCCTG7GGC7AG7G7AGACCAA7GA7CCTG7CTCAGA 
GTCAC7AGCCAACAGCCCATA7CAAG7AC7TGAAAC77TGACTCAGAAAC 
C7CAG7G7CAGAACC777GACC7AGGAACCACC7G7AG7GG77AAC7GCA 
A77TGCACCCC7TAG7TCAGGGCTTTACAACACCGGGGGCGGGGAGGGGA 
AAGGCA7ANANC7GATGACCTAAAGGAAACCCA7TGCAGCAACGCTTTTG 
7G7TAAG7G7ACAAATAAGTGT7G77T7AGAA7CC7CCAGG7AA7GCCrT 
TG77A777AA7G7G7CTGAGACAA77CTGCACA7TAAAGAATA7AAAA7A 
T7ACC7TG7AA77CCAA777GAAA7G7G7AA77GACA77AGAC77C7A77 
7GAA7TTGAAA7G7C7AAAACAA7G7GG77AAG7T7G7AAAAGG7G7GTG 
AA7TT7GAG7C7GA7TTACTACAlTrTT Tm ' A ATT':TCrT rm T7TGG 
AG7777AGGGA77GC7TAGATGGC7AGAAAGA77T7A77CA7CAGA7777 
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TAAGTCTGCCTTGGCAGGCACTTGCAGx oTTTGAAAGAATCAGATA7ATC 

AAATTTGTAGTTTAAAATATTTAAGGGAACTCAATTAACTATGCTAGAAA 

AGAGAATTAAGTATTTAGGAGGATTTAATATGGTGTGAAAGTTGTGAAAA 

TCAAAATGGAGACACTAATGTTAAGAAAACCCTGATAAATGGAACCAGGG 

AAAGGCATGAAGATAGAGTTCTCACACTTGTATCCCTGATCATGAAAAAG 
A7C7GC 

>Cpntig5 

GGGTTTTTCCGCGTTTTTACCCGAAATCTTCAAGGGATGGGAAAAAGAAA 
ATTGCTAAAAAATCTCGGTTTTTTGGTTTTAACAGATATTTACACCNTGG 
ATCCCAT77ATTATGTTGTCCCCAAGGTTTTCGGTGGGTTCCCAATCAGT 
7AGCCCCCC7CCACAG7GAAAGCAC777AC777A7CACC77CACC7AAAG 
CA7AAAA7CCAGC7C77GAAAGC7GC7CC77G77AAC7GAA7A7A7CCAC 
A7CCCAAAAG7AA7GA7CCA7GC77CA7AA7C7GCCACGGA7GGA7GGA7 
GGA7GGA7GGA7GGA7GGA7GGA7GGA7GAA7GGA7GGA77GA777C77G 
GAGGA777G77GAA777GGGAAA77CCACGCCAGGACAGC7GGCCCAAAC 
7GCCCGCGACAA7C7GC7CGG7ACAAGGGGAGGG7CC7GGAQAGGG7GCG 
GCCCGAGCCCCAG777GGAAA7GCCAAC77GGC7C7GCAGCCGGGCC77A 
GCCAC77GGG7C7GGCG7CCC7CCA77A77AGCGCCA7GCCGGC7CGGGG 
7GC7GCCAAG7CCC7GAGAGCACAAGCC 

>Concig6 

CGCGC7CAAGAAAAGC7GAAG7G7GAA7G77C7G7C7ACC77CACAG7AA 
A7GC7AAGAGAA7GACCCAAGAGCAGAGGG7A7CAC7C7GC7ACGGAGGA 
T7GA77G7AAC7GGC7C7CC7GCC77AGCAAGAAA7GCCAGAACCA7GG7 
CA77CAAG77C77GACCAAAAAC7GCC77CA7GAGAA7CAAC77CCCCAA 
GAAAAAAAAAGCAGAAACAGGCAAAGC77CCAGCA7GG7AGG7AA7AC7G 
ACCC77CT7CCC7CC7TCC777GGAGA77CACACAG7AA7AA7GCA7AAA 
GC777GCCAA7GGAC7AAGCAC7GCCCAGGGG77777G7CA7GCC7GGAC 
7GAAA7GC7C77777GCG77A7CA7AGAA7CCCAG7GCAG7C7GAG7AGA 
C7C7AAGCAAAAGGGACA77777CAAAAAGGC777AAA77GC7AG7ACAA 
AGAAGGCAACAAAAC77GCG7AAC7G7GGACAGA77AAC7CAC77GG7G7 
777GGC7C77CAG7777CCC77GGC7GCGAAG7AC7CC7GAAGC777C7C 

TGCGGC7C77CC7GCAAGCAGGCAAGCAAAAAAACGAC7GAAC777A777 

CGAGA7 

>Contig7 

GAAGAGCCGC7AAC77GC7G7AG7GA7AAGGAA7GAAC7AAGGC7AGGGA 

CA7A77AACA7CCGC7GG7GG7GAC7C777AGCC7AGA7C77ACCCCAC7 

CC7GC7CC77CCA7A7GG77CGG7C7CAGGC7CAC7ACCGA7CAA7GGCG 

7AC7AAAAGCAC7AAC7A7AGAC7CCAACACG7C7G7CG7G7G777CACG 

ACAAGCCG7GGAG77AA7CCC7C7GACAG7AGC7CAGA7AAGGA7GGGC7 

A7CA7GGGCCCGGAAC7GGGGCA7GACGC7CG7CACCAACGCA7GAGC7C 

CCCAAG7A7GC7A7ACC7G7CCC7A7GAAGGGC77CCAAC7C7A7G7GCA 

G7CCCCA7G7GGAGAG7CAGG7A77GA77GA7CAAGCCAGGGG7G7GG7G 

AA7GGGGAGC77CC7ACAGGGG7AA7GA7AA77GAAA7GCACGG7GA7GG 

GGA7777CA7A77GG7C7CC7AAGGAGA7AACAGA77GGA7GCGGGG7CG 

A7A77CCAC7GCCCAGGG7G7G7ACCGAGGG7A7C7GCAGG7GGA7C7CC 

7CCCCACG777GA77AA7AC7CC7G7C77GGGAAGCA7AGACGGGCGGGG 

GAAA7GA7GAAGGG7GACCAC7CCCC 

>Contig8 

GGGAACGCAG7GC7C7G7ACGA7GGCC77GA77GCGAA77CC7GCAGGGG 
GGG 

>Contia9 

GGCAAGAGA777AA7A77CA77CCA7C77CA777GGAAGA7GAAAAA77G 
GGGACCAGAGAGGGGAGGGGAC7GGGCCAAG7777CAAAGAAAAG7CAG7 
AGGAA77G7GAA77CC7GGGGGCCGGGGCCCA77AG7GC7G7777GGA7C 
AG7AAA7GGAGA7G7GAG777CAACAG7AACAGGGACA7777AAAA77AA 
AA7GA777AACC777AGAAAA7G7CC7A7777G7AA7AA7GA7GGA77CA 
CAGGAAGG7ACAAAGAAA7G7CCAGAGAG77CN7GAGCCCCC77CAGCCA 
GC77C77CCAA7G77AACA7C77GCA77A77A7AG7ACAACA7CAAAAC7 
GGGAAA7C3A7A77GG7AC7G7CCAGA7AGC77AC7CAGA7777GCCAG7 
TA7AC77CCAC7CA777G7G7G7G7G7G7G7G7G7G7G7G7G7G7G7G7G 
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TGTGTGTAGCTCTATGC^TTTTATO 

AA7CACAA7AC77AAC7A7GCCC7CA7CACAAGAC7C7C7C77GC7A7GC 

TTTACAGCTGTATCCTCTTCATCTCCAAACCCTAAGCCCACCTCACCGCC 

TCCACCATCTCTAATCCCTGGCAACCACTATTCTGTGCTCCATCTCTGTA 

ATTAAT7GTGTTAATTAATGTTATACAAATGGAATCATGAAG7ATGTG7C 

CTTTGAGATTGGGCTGTTAATTTTTCACTCAGCACAATTTCCGTGAGTCT 

AATCCAACTTGTGTGTAGCAGTAATTCTTTCCTTATTATTGCTGAATAAT 

ATGCCATGGTATGGATGTATCACAGTGTGTCTAATCCTTTGCCCATTGAA 

AGGAATTTGGATAATTTCCAGGTTTTGGCTATTATGAATAAAGTGAACAT 

AAGACATGTGTGTACAAATTTTGGTGTGATCAAAAGTCTCATTTCTCTGG 

GATAAATGCCCGGTAATGAAATGGCTGGGTTGTGTGGG 
>ConciglO 

GCAAGAACACAGGCGCGTATTATAACCTTACTACCAAGACCTGAACCCAT 
ATAAAGGTTTATGCGTAACAATCATCATCCCTGTTCCAGAAGATTACACG 
TACGACCACGCCTGGCTCACCGACTCACGTGGGCCAGTACCAGAAATTCT 
CCCAAACAAACAGTCGTGTCTGAAAACAATCGCGGTGACCTC.CACGGTTA 
GAAAAGCCTGTTTTCAAGTCCTGGAATTGCCACATATTAGCTGGGTAACT 
TTGGGCATCACATTTACTCTCTCCGAATTTCAGATTGCAAAAACTCATTG 
GATTGTTTTGTGGATTGAAAGAAATAATGTAAATTTAGGCCGAGTGCTTT 
GACTTACGCCTGTAATCCTATCACTTTGGGAGGCCAAAGCAGGAGGGTCA 
CTTGAGC7CAGGAATTTGAGACCACCTCTGGCAACATAGTGAGATCCTGT 
C7C7ACAAAAAA77777777AAA77A7CCAGCA7GG7GG7ACACGCC7G7 
A77CCCAGC7AC7CAGGAGAC7GAGG7G7GAGGA77GC7AGAACC7GGGA 
GATCAAG7CAACAGTGAGCCGTGG7TGTGCCACTGCCCTCCAACCTCAGT 
GACAGAGGAAGACCC7G7C7CAAAAAAAAAAAAAAAAG7AG7AAG777AA 
AGAAC77AG7G7AGGCC7GGCA7A7AAA7GA7A77G77GA7G77GA7G77 
AGC77GAAGGCACA777A7AGGAG7AGGGA7777A7AACA77A7GAGCC7 
GAGAGCACA7A7AA7G77CCC 

>Contigll 

GG7C7AACA7GC7CCAAC7GAAGAAACCCCACAC77G7CCGGCAAGGAAA 

C7AC7ACAGA777CC7GACC7AC7G7GCAA77CGGGGCA7GCGACGGGAC 

7G7G777C7GGG7ACGC7G7C7CAGG77CG7C7GGGA7G7AAGAAT7CAA 

C77CAG7AG77C7C7CA7AGACGCCGACGAGAGGGGCG7C7C7777C7C7 

GA7GAA7C7GCCAGA7C77CCAC77CA7AGAG7C7AAA7CC7CCGA77CG 

A7C7AC7GGAGACCCCCACG77ACAAAAACG7C7AACG7CGG7GACAGC7 

CCCCACA7AGGGAAAGA7CACC7GAG7C7CAC7ACC7CACA77AG7GC7A 

TC7CCAGCCCCA7GC7A7C7ACGAGA7GG7CACGCGAGG777AAGGGG7C 

7CCGA77CCGG7GG7CCGA77CAGC7AA7CG7GGCCC7ACG7GAACGA7C 

AC7 C C7GC7CG7AACA7CGA7ACAGGG7CGCGC7GACAAA7GG7AC7ACG 

7AGG77C7CAGG7CAA7GCCGCG7CACGAA7GAGCC7AAC7ACCCCA7AA 

G7GCACG7AC7G7G77ACC777CC7G77CGGCCAAACC7GC7AC7G7A7G 
C7G7GC77G777 

>Concigl2 

AGGC7CCA7G7GC7C7AGCC7GA77A7C77T7CAAG7G7777A777GC7A 
A7C7A7AAGGCCCC777CG7AAAA7G77CAC7CA7777C7AA77AGA7A7 
7777777AA7G77GAG77TTGAGAG77C777AGA7A7777AGA7ACAAG7 
CCA77G7CAAA7A7G7GA777ACAAA7A7777C7C7CAA7C7G7AATT7A 
G7777CA7CC7C77AACAGGG7C7777GGAGAGCAAA7AA777GA7777C 
A7AAGG77CAAA77A77AA777777C77G7A7AG77CACAC77C7AG7G7 
7AAG7C7AAAAAC7G7GCC7TG7CA7AGG7ACCAAAGG7777C7CCAG77 
777777C7AGAAG777AGAG777CA7G7777ACA77GGAG7CCA7GA7CC 
A77G77AA77AA77777G7A7A7AGG7AGA7G777AGG777AGGG77777 
77AAAAAAAAA77ACA7A7G777AA77GC7CCAG77CCC777CA77GAAA 
AGGG7A7CC77CC7CCA77GAA77GCC777G7CAGAAA77AA77GGACA7 
A777G7G7GAG7CTA777C7GGGC7C777A7CA7G77AC7777AAAAAA7 
GCA7CAG77CC7CCACCAA7ACC7CATrG7C77GA77A77GCAGT , 7A7A7 
AG7AAGCC77AGCA77AGGAAAAG7G77777CC7GC777A77C777NTCA 
AAAAA77777GGA7A77C7AGGGCC777ACA7A7AAA7777AAAA7AAC7 
77G7C7A7G7C7AACCGAAAGCC77A7GAAGA7777GA7AAGAA77GCA7 
TA7GCC7A7ACA77AA777AAAAAGAAC7GA7G7C777A77CAG77GA77 
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CTG C TAAT CT ATGAACA1 ^GCATCTCT ^ . CAAAGCATTTAGTCTTT C.t'T * 
AATTTCTGTCATTAATTTTTTAAAATTTTCATCCTAAAGATTCTGTATAT 

g7777g77gaa777a7gc77aagca777cac777c77gg7aacaa77a7a 
aatgattt7gtgttttttattccactagttcattttcagtgtgtagaaaa 
gcaatgaatttttgtgtgttgatctttgttcc7acatcttgcaacattat 
tgaactcatttattagttctaggaggttttttcatttttcttgtagatac 
cttgagattttctatatagacagtcatgttgtctgcaaacaggcacagtt 
ttXtttcttccttttcaatctatatgcctttttttttttttttgccttat 
tgcagtggctagaacttctagcac7atgtcaaatagcattggtgaaagca 
gaca7cc77g77cc77g7c77agaggaaca777gg7c777aa7c77gga7 

TGCG 

>Concigl3 

GCGCCTCCTTTTCTCTTCCAAAATTTCTCTTGTCTAGTTATTTGTCCAGG 

GAAATTTGAAAGCTCACTTACTGTGCAAGTCAGCAGGAAACAACTGGGTC 

TGTGCACAGCACCTAGCAAAGTTCTGCTCTAGGAATTACACTTTGGCCCT 

GAGG7AGATTTCTACAAGAACCTTACCTTCTAAGCAGCACTGGGGTTCAT 

C77777CCCAG7CC7CAGAGCCCA7777CAC7CC7GAG77C7CCCCCACA 

AAGGACA7777CAACG77GAG777A77AC7CAACAGAAAA7GGAA7GAAG 

7CCAAGACC7AAGGAGA7AGAAAGGGGACCAG77A7GGCA7C77C7CACC 

CCAGGACACC77GC7GCA7G7C7C7AG7GC7GAACAGACCAC7GGCC77G 

C7C7G7AG777GAAA7GC7CGC7GCAACCAGAAAGGCACCAAGGGGCCAG 

ACCA7GC7C7CC7G7C7A7CACGCC77CAAAGCAGAA777CCCAAACC77 

GAG7CACAG7GC7AACACACGGGG7GCCA7AACA77777G77GA7777GG 

CA777TACAAAAA7AAAA7AAAAAAG77AAAAA7GCA77GC7C7A77C77 

GGGGC7GGCACAC7A77GCC777GGCCAAA7CCGG7CCC7GAC7G77777 

77AAA7AAAG7777A77GAAACACAACCA7GC7C77G7G7ACA7A77G7C 

TC77GGC7GC77CGAAGC7ACAA7A 

>Contigl4 

GTG77CGC77777AACAC77ACC7AAAA77AC7C7G7AA7CCA7GGA7CC 
77AA777A777AAAAAAC7AA7G77AA7GAG7AGC777A7777CC7CCCA 
7CTAA777AAGGCCCACAGAACACC77CAC77ACC7CAA7CC7C7CCCAA 
C7TACA7GC7777AA7G7CA7A7A7G77AA7ACCG7A7AC7777AAAAC7 
77C7AAAA7AGCA77A7777A7AGCA7GAG7G77CA777ACA77777GCA 
7A7A777AGAA7777C777GC7C77CG777C77C77C7A777A7GAC7CC 
CC7C7GGGA7CA7777CC77C7AC77GAAG7ACA7AG777AGAAC7GCAC 
7A77CAA7ACAG7AGCCAC7AGCCA7G7G7AGC7A77GAAG777AAAC7A 
AG7AAAA77GAG7AA7A77AAAAAC7CAG77CC77CA7C7CAC7AGCCAC 
A777CAAG7GC7CAGCAGCCACG7GCGAC7AA7GAC7AC7G7ACA7CAAA 
CA7A7AGAACA777CCA7CA7GGCAAAGAGC7C7A77GA7AG7G77CA7C 
CAGAG777C7G77CCAGGACCAAAC7GAGGG77GGGC7GC7A777C7CA7 
GGCCCAA7AACAAGA7GCAGA7GAGC7GGGGAGGAAGAGAG77777A777 
C7GCNACCA777ACCGGGAGAAGGCC7GGAAA7CA7CACCAGGCCAAC7C 
AAAA77A77ACG7777CCAQAGC77A7A7ACC77C7AAGC7A7A7G7C7A 
CG7G7AAG7G7GCA7TCACCTGAAGACG77AG7GA77AAC77C7777AA7 
C7G7AAC7AAGG7C7GAG7CCGGAAGA7C77CCCC7GGAGCC7CAG7AAA 
777AC77AA7C7AAA7GGGTCCAGG7GC7GGGG7AA77ACCC77ATC77G 
TCCCC7GC7AAA7CA7GGAGG777GGGGA77CC777AGAGCACCAA7AAA 
C77G777G7GGAGGCC7GGGGG777C77C7GACCCACAA7AAAAC77G77 
7AA7CC7AAA7GGG7CC7GT7AAGAA77CC77C777A77T7G7CA7A777 
7AAGGCCCAGAAAAGGCC7GGGCAAAAC7C77GA7GGGC7777G77ACA7 
7CCAGCC777G7A7AAGAACAC7GG77777AA7A777AAC77AACCA777 
AG7CAG7AC7GAAACAGT7G7TA7AGAGA7C7GCA77AG7GAGACC7GGC 
C7GCCACA777CC7TT7C7GAAGA7C77A7GG7AG7GA7CACC777G7GA 
AAGGAAAA7AAA7C77GGGACC7CAAAA7CAC7AAGCCAAAGAAAAAAG7 
CAAGC7GGGAAGAA7C7GACAC77AAA7CCAACAC7GC7AAC7CA77CA7 
C7CAC7CA77CA77CA7777A7777C777777C777C777777777 7 777 
777777GAAACGAAG7C77GC7C7G7CACCCAAGC7GGAG7GCAG7GGA7 
C7CAGG7CAC7GCAACC7CCACC7CCCGGG77CAAGCGA77C7CC7ACC7 
CAGAC7CC7GAG7AGC7GGAA77ACAGGCACC7GCCACCACGCC7GGC7A 
A77777A7A77777AG7AGAGACGGGG777CACCA7G77CA7CAGGC7GG 
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7CTCGAACTCCTGACCTCGTGATCCGG- -CCCCCTCGGCCTTGTTTGCI x 

GAGGTACTGTCTAAATGCTGGAACTGAAAATGGCAAGCAAGACATCCCTA 

CCCTTGAGGAAACTGTAATCTAGTCGGAAATACAGATGTCAACCAAGTCT 

CACACAAG AAN ATTGT ACAAAAC C C CTAGGA 

>Concigl5 

GGAAAAACCTATCACCGCCTCCTATGGAACTTAAAACAAAAAGAAAAGTA 
ACAAAGGAAATGAATATTTC^TTCTGGAAGAACATTGAAAAAGAACAGGA 
AGAAGAGAAAGCACAACTCGAACTGTCCACTAGAATTGACAACACTCTGA 
CAGAATGTCTGAACCTCATCGAAGGGGTAAGTGAAAAAAATAAGCTCCTC 
CAGCTTTGGCCCAAAGTCTTATAATTTTTAAACATATTCCTAAATATAAT 
ATAGGAGAGATAGCCTTCATCTAAGTAGAAATTTAGCTACTCTTGTAAAT 
ACAGAGTAATAATAATAATGACATGCCCATAAACAGTGTCTTTTGTGTAT 
CTGTGCTTTTATAAGCACTTAGCTAAGATTATCTCACATAATTATCATAA 
CCACTGTTACTATGACCACTTTACAAACAAAACTGAGGCACAAAGAAGTT 
GGAAAACTAATCCAAACAAACTGGCTCCAAAAGGAACTTTGCTTTCTTTG 
GGTATCAAGTTCTGAAGAGTACACATTTAACATTGAAACTGAGGTCAGAA 
GGCAAGTTTCTATGTAAAGTTGGAGTATTCTGAATACTCTGGGTAGCTAC 
AAATAGTATTTAAATTTTATCTTGGATTCTGCAGATAAGGATAAAATAGA 
TGGTAGGCAAAGAGTATGATCCTTAGGAGAAATTTTTCCTGAAGGAAAAA 
TATATTAATAAAAAATGATGGAATAAACTTCTAAGATCCTTGCCTAGAGC 
AAAACTCATTCAGTCCTTTGGCTGGTAATGTTGAACATCAACAAAAAAAA 
GGAAAAGTTC^GTTTAAGTCTACTCCAGGCAACATTTTCACAACATCCAG 
TTAAATATTAACTATTTCTCTTTGTGGAATTGAACTAGAG7TCTTTTTCT 
TATCCTCTTTTTTGGTTGTTGTATTATTTAAAAATGAGTACCTTTTTATT 
ATTGAAATCATTTCAAGTAATGCAGATAAATGATCAGCCCTCTCCCTGTA 
CAAACATACATACTTAGGCATCCCAAACTTCTCTCTGGAGGTGACCACCA 
TTGCCAGTCATTCATTCTGTTTTCATGCATGTCCATACAGTATAGGTATG 
TCGAGA AATGAAGTATTATATTTTTGTGAGTTGCAATTCTTTTATTCACA 
TTTTTGTGTACTTTGGTTGTCTTTTCTTGTGTTTTCCTAGTACCAATGTT 
ATGCTGACTTAGGCAGATGAGTTGAGTATTTTCCTTTTTGCCCTATAAAC 
TGAAAATAGTTTGTATGACATGAGAATTATTTTTATTTTTTGAAGGTTTG 

ATAAAAACTTGCCCATAAAAATCGTCTGGACCGGTTTCTTGAGGATGCC7 

GTGTTAGAGCC 

>Contigl6 

CGCTTTAACCTGGGCTACCAATGGTTCGTCAAGTTCTAGATTCTCTATTA 

ATACCTTTTTCTTGTGTCTTTCTCTGGTCTGTTTTCAGCCCCGAGTCTCT 

TAGATCTGTCCTCTAATATTCCTATTGACTTTACTTCATTTTCTAAGTCT 

rTATCCTTTTGCTTTACTTTCCGAGAGACCTGCTTAACCTTATCTCCCAA 

CTCTTTTATTGAATTTCATTTCTTTTACTATATATTTTTTACTTTGAATA 

CACCTCTCTCTTCCTCACATTTTCCCCCATAGTATTTTGTCTTCAATTGA 

CAGTTCTACTATCTTATTACTCTGGAGATATTAATAATAGTTTTTAAATT 

TTTATTTATTTTTATTTTCAAAACAGTGTCTTACTCTGTCACTCACGCTG 

GAGTGCAGTGGTGTGATCATGGATCACTGCAGCCTTGATCTCTGAGCTCA 

AGCTATCCTCCTGCTTCAGCCTCCCAAGTAGCTGGAACCACAGGCATGTG 

TCACCATACCCAGCTAATTTTTTTGTTTTTGAGGTGGAGTCTCACTCTGT 

AGCCCGGTCTGGAGTGCAGTGGTGCAATCTGGGCTCACAGCAACCTCTGC 

CTCCTGGGTCCTGGTTCAAGCAATTCTCCTGCCTCAGCCTCCTGAGTAGC 

TGGGATTACAGAAACACACTACCATGCCCAGCTAATTTTTGTATTTTTGT 

AGAGACAGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCT 

TGTGATCTGCCCACCTTGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAG 

CCACTGCACCCGGCCACTAATTTTTAAATTGTTAATAAAGACGAGGTCTT 

GCTATGTTGCCCAGTATGGTCTTGAACTCCTGGGCTTAAGTAATCCTCCT 

GCCTCAGCCTCCCAAAGTGTTGGGATTACAGGTGTGAGCCACTGAATCTG 
ACAITTI^AAAAGTTTTC^ 

GCTTTT1TGGGTTGTTTTATTTTGATCTCTATCTTGCTAGAAACTTTCTG 
CAGACGTTTAGTAATACTAGATTTTTGAGAGTGGGCAACTGGAAAGCTGA 
TTGGAAACTCTGAATACATGGGTGAGGCTTGTTGGCTGTGAGTGTCATTG 
CTTGATGTCCTGGCAAGGCCAATGGGTTTGGGACCCCTACTATTAGTATA 
GGCCTGATTCCCTGGGAAAGGCTCTTTTGATCTCCTGCCTGGAGGATAAA 
GGCCTGGCTACCAGCCTTCTGTGTGTAATGTGAGGGAGAAGGGCTGGAGT 
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ATTCAACATCATGCTGAAxCCTTTCAA. aATCATCTTGTTTTTAGTAA^U 
TCCTACCTTAACTCTCTGTCTTCTGCTAGTATGGGAAAGATGACCTGAAA 
ATCTAACCATTTATTTTTCCCCCATTAATATCATTTTATGATTATTCAGA 
AGTTAAATAATTGTCATGCTGTCCTCCAAAAAGACTGAATCAACTAGCAA 
GAAATAAGAATTTTCTCACAGCTCTGCCAGCATTTTAAAAGAATAGCTTT 
ATTGAGCCCAGGAGGTCAAGGCTGCAGTGAGCTGTGATTACACCACTCTA 
CCfCAGCCTGGGTGACAGAGCAAAACCCTGTCTCAAAAAAGAAATTTAAG 
GAACAGC7TTATTGTTGTAAAATAGACATACAATAAACAGAGCACATATT 
TAAAT7GTGCAACTTATACTTTGATATAACCCTGTGAAAACATCACCACA 
ATCAAGATAGTGAATATATTTATCACCTCCTGATACAGTTTAGCTCTGTG 
TCCCCACCTAAGTCTCATGTTGAATTGTAATCCCCAATGCTGGGGGAGGG 
GCTTTGTGGGAGGTGATTGAATTGTGGGGGTGCACTTCCCCCTTGCTGTT 
CTTGAGATAGTGAATGAGCTCTCATGAGCTCCCCTTCACTCACTCTCTTT 
CCTGCTGCCATGTGAGGATGTGCTTGCCTCTTCTTTGCCCTTCTGCCATG 
ATGTGTTTCCTGAGTCCTCCCTAACCATGCCTCCTGTACAGCTTGCAGAA 
CTGTGAGTCAGTTAAATCTCTTTTCTTCATAAATTACCCAGTCTCAGGTG 
GCTCTTTATAGCAGTGTGAAAAGGAACTAATATACCTCCTAAGTTACCTC 
AAGCTTGTTTTTAATTCCTTCTCCTCCCTTCCTTCATTGCCAAGCAAACA 
ACCACCTGTTTTCTGTCACTATAGATTAGTTTACATTTTGTGGGTTTTTT 
TTTTTTTTGAGACAAGGTCTGACTCTGTTGCACAGGAGCAGAGCAGCGTA 
TC 

>ContigI7 

CGCGTTATAGGAGATGCGAACTTAAGAAATGATGATAAGGAGACTTTATT 

AAATATAATTTTGAATTATTTTGCCATTACAGAAATTCTAATTATTTAAA 

ATTCTATTCATAATTTTTAATCACTGTACTTCCCAAGCTTAGCTTAGAAT 

CCTTCTGTGCTGAGGATTAATTTTAATTTGTCTTTTATAGGCCTTATCTA 

AAAT C C AAGAATAATTGC CAGAATC AAC CAC CTT CTAAAT CTGTAAGTAG 

AAATTAGTCTTTTTAAAAATATGCATTCATAAGTATGATTAGTAATAAAA 

ATAATAAAGATGTTAGCAACCTAAAGAACATGTATTTGAAAGGTATTTCT 

TACAGATATAAAAACAGTTTGGTTTAATAAGAGACAATCATTTTTTGAAA 

AGTATGACATTTTTTGAAAAGTAGTTTAGTTTTATTAACCAAGAAAAGCC 

TCAAGTGAACTTT AGTC CTCTTGATAGCTAACATTTATTGAATGCTTACT 

GTGTGCCTGATACTTTTCTGACTTGCATTACCTCACTGAGTCCTCACAAT 

CTTATGAGGCTACTATTAGTAGCCCCACTTTACAGATGAGCAAACTAAGT 

CACAGAAAGGTTAAATAGGTCGTATAGCTATTAAGTGACAAAGCTGAGAG 

CCTGTGATCTTAACCACTTTGGTATGCTGCCATGAAGTTAAATAGCTCAG 

TAGTCATTAAAAGAGAACATTTGCATTGAACCTTCCAAGCCACTTAACAA 

GTATATGCTTCCTAATCAATTTAATTTAGCTACATTAGATAGAATGGTAA 

AGGATCCTTAACTTAAAGTTTAAATGGAAGAAATTAGCCCTCTGAAAGAG 

GCACAGATTATTCATCTGCAATAAAAATCTCACCTTTAGTTTTTTAAAAC 

ATAGTTTTTATCTGTGTTCTGAAATGTAACTAAAACAGTGCTTCCTGAAG 

TGAAAAATTCTCACTGGTGAGAATTTTAATAAGTTTTAATGATTCACCZAA 

ATCACTTCAGTCATATTTCAGTCATATGCATATGCATATATAGACATATA 

AGTTTTTATCTGTGTTCTGAAATGTAACTAAAATAGTGCTTCCTGAAGTG 

AAAAATTCTCACTGGTGAGAATTTTAATAAGTTTTAATGATTCACCAAAT 

CACTTCAGTCATATTTCAGTCATATGCATATGCATATGTAGACATATATA 

TGTTGTATGTATACATGACATCATTAGACACTGTGAAGGATAGCAAAATG 

TATATAAGGCAAAATTTATGAACZAATGGTTTAACGTTTGGGAAGCACTGG 

GTTACACTTTTACTTTATGCAGATTGAACCAGTATAGTATGCAAGTCTTA 

AGGAAAAATCTACTGGAAAGGGCCCTCATTCAGACTTCCCAGAGGCTTCT 

CTGGAAGTTGACAATACTGACTTCAGTACATCAGCTCGTAAATGAGGATG 

ATACCTACCTTATCTGCTTTACACAGTTGTAAAAGTAAAAAGTGAACTCA 

GGAAGGGAATTACAGAATTTAGGAGAAACTAAAAGCACGATGTAAATAAT 

AGTCATCATTACAGTTATATAATGCTTGACAATTTATATAACACTTTCGA 

TACATGACAACAATAACTAACACCCAGACATGTTTATATACATTACCTCA 

CTCAGAACAACCATGTGAGGAAGTTGGCCATATGCTTTAATGTCCAAACC 

AGGACACTTTTGAGAGTAAAAAGCAGTACrCTTTGACCAACAGGCATAAA 

TCAAAACTATCTTGTGAAAACCGGGATATATGGCATCCTTCCTAGATAAT 

AGATACTTTTACTATTATTAATTTTGCTGTGAATCTAAACCTGCTCTAAA 

AAAGTTAATTTTAAAAAGTAATGAAGTACTGATACATGCTACAACATGGG 
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TAAATCTTGAAAACGTT^rGCTAAGTG* ^ AGAAGCCAGACAGAASShSSL* * 
ACATATTACATGATTCCATTTATATGACACATCTAAAATAGGCACATCTA 
T AG ACAT ACAG AG ACAGAAAGT AGACTAGCGGTTG C CAAG AACTG CAGGG 

AGCAGAAGA7GGGGAGTGACTGCCAATANGAAAACGCATTACGT 
>Concigl8 

TGAATCGCAATGATATGTGCCACTTTGCACTCTCTGTGACATATATAATT 

ATTTTTAATGCATTCATTTTTTTCTCAGAGTGCATTCGTTTGAAAACATA 

GA^CGGGAAATACTGGTAGTCTTCCTTGTCAGTTAGAAACACCCAAACAAT 

GAAAAATGAAAAAGTTGCACAAATAGTCTCTAAAAACAATGAAACTATTG 

C CTGAGGAATTGAAGTTTAAAAAGAAGCACATAAGCAACAACAAGGATAA 

TCCTAGAAAACCAGTTCTGCTGACTGGGTGATTTCACTTCTCTTTGCTTC 

CTCATCTGGATTGGCATATTCCTAATATCCCCTCCAGAACTATTTTCCCT 

GTTTGTACTAAACTGTGTATATCATCTGTGTTTGTACATAGACATTAATC 

TGCACTTGTGATCATGGTTTTAGAAATCATCAAGCCTAGGTCAGCACCTT 

TTAGC7TCCTGAGCAATGTGAAATACAACTTTATGAGGATCATCAAATAC 

GAATTCATCCTGAATGACGCCCTCAATCAAAGTATAATTCG&GCCAATGA 

TCAGTACCTCACGGCTGCTGCATTACATAATCTGGATGAAGCAGGTACAT 

TAAAATGGCACCAGACATTTCTGTCATCCTCCCCTCCTTTCATTTACTTA 

TTTATTTATTTCAATCTTTCTGCTTGCAAAAAACATACCTCTTCAGAGTT 

CTGGGTTGCACAATTCTTCCAGAATAGCTTGAAACACAGCACCCCCATAA 

AAATCCCAAGCCAGGGCAGAAGGTTCAACTAAATCTGGAAGTTCCACAAG 

AGAGAAGTTTCCTATCTTTGAGAGTAAAGGGTTGTGCACAAAGCTAGCTG 

ATGTACTACCTCTTTGGTTCTTTCAGACATTCTTACCCTCAATTTTAAAA 

CTGAGGAAACTGTCAGACATATTAAATGATTTACTCAGATTTACCCAGAA 

GCCAATGAAGAACAATCACTCTCCTTTAAAAAGTCTGTTGATCAAACTCA 

CAAGTAACACCAAACCAGGAAGATCTTTATTATCTCTGATAACATATTTG 

TGAGGCAAAACCTCCAATAAGCTACAAATATGGCTTAAAGGATGAAGTTT 

AGTGTCCAAAAACTTTTATCACACACATCCAATTTTCATGGCGGACATGT 

TTTAGTTTCAACAGTATACIATATTTTCAAAGGTCCAGAGAGGCAATTTTG 

CAATAAACAAGCAAGACTTTTTCTGATTGGATGCACTTCAGCTAACATGC 

TTTCAACTCTACATTTACAdAATTATTTTGTGTTCTATTTTTCTACTTAAT 

ATTATTTCTGCAATTTTCCCAATATTGACATCGTGTATGTATTTGCCATT 

TTTAATATCACTAGACAATTCAATCAGGTTGCTACGTTGGTCCCTTGGGT 

TTACTCTAAATAGCTTGATTGCAAATATCTTTGTATATATTATTGTTTTT 

TCTCCTATCTTGTAATTTCTTTGAGCACATCCCAAAGAGGAATGCCTAGA 

TCAATGGGCACAAATAATTTGACAGCTCTTATTAAACATTATTCTGTAAG 

TAAAAACTGAACTACTTTTCAGTATCACTAGCAACATATGAGTGTATCAG 

CTTCCTAAACCCCTCCATGTTAGGTCATTATGAACTTATGATCTAACAAA 

TTACAGGG7CTTATCCCACTAATGAAATTATAAGAGATTCAACACTTATT 

CAGCCCCGAAGGATTCATTCAACGTAGAAAATTCTAAGAACATTAACCAA 

GTATTTACCTGCCTAGTGAGTGTGGAAGACATTGTGAAGGACACAAAGAT 

GTATAGAATTCCATTCCTGACTTCCAGGTATTTACACCATAGGTGGGGAC 

CTAACTAC^CACACACAC^CACACACAC^CACACACACACACACACACAC 

CATGCACACACAATCTACATCAACACTTGATTTTATACAAATACAATGAA 

TTTACTTTCTTTTTGGTTCTTCTCTTCACCAGTGAAATTTGACATGGGTG 

CTTATAAGTCATCAAAGGATGATGCTAAAATTACCGTGATTCTAAGAATC 

TCAAAAACTCAATTGTTTGTGACTGCGCAAGAAGAAAACCACCCATGCTG 

CTGAAAGTCAGTTGTCCTTTGTCTCCAACTTTACTTCCTTTACCTCTCAT 

ATG7TTGTGAATAAGCCCAATAAGCAGACNCCTCCTACAAAGTGAACCTG 

GTCTCTTTCCTCCTAACAGGG 

>Concigl9 

GTCTTGTAACACAGGTAAGACGAGTTCAAGTTTTATTTCTTGNTTTTAGA 
ACGGTAGTGAGCGGTTTTCAGCNTGAGACCACACCTAAGGTAAGTAGCTG 
AATTGGGGTTTTGTCTTGGCTAAAGTTTAACAACCAGCTGGTCTTAATTT 
CTCCTTACCATTAGAGCACTCAG TAATCA TATAAGTTGTGTGATCATTCA 
1T1TGCTTAACTGTTTGTTTCTGT1T1TATTGCTGTTTCAGTCT1TTTCC 
CATTGGGTTTGACCTACTCTATCTGACTTGATCAAATCCAAAGGAAATTT 
CCAAATTATGGGGAATGAGGCCTCTGAAGTGGCTAAATTCCCACCCTCCC 
ACACACACAAACGTGGTATGGTGGGGGAAAAAACGGCCAGCAAAAGAAAA 
AAAAAAAGGAAAAGATGTTTCATTTTGACCACCAAACGGGCTTTATTTAC 
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A7AACAAGGCCACC7TT. /GC7AGCCA jCCA7AC7GAAAGAGGH!HIGL 

TGTTGCCCCATGCTGTGGGTTCCATAGCTAACGTTCTGCCTTTTTTCCTA 

CCACGACAGCCTGGGTTTGGTTCCTAAATCAAGCCTTTTCTGGTTTGATA 

CTTGGTAATGCTGAAATAGCAGCAATTTGTCCTAGCTGAAATATCGTAAT 

AAGATTTTAAAAGATTTATTTTAAAGGACCTCAATAGTTAAAAGTCAGCT 

TAATTAAAAGCTAACATCCAAGATGTGTGCATGTGTATGTATGCGTCTTT 

GTATTTAAATAGCCCTCATGTTTTTTTTTTCTTTCCTAGGAACTTGCCTT 

TTTTTGAGCAAAAGTTTTTTTCTTCTCTGTTGACTGGATTCTGTTTTCTT 

CATTTACTTCTGCTGTCTCTCCTTTCTCTTGCACCGTCTGCTGCATGAGA 

GCCCTAAAATAGTTTATAATAGCCTGGGGTTCCTTAAAGAAAATGGAGAA 

GGTGCCAGGCTCCCTTTTAGGGAGAAACTTCTATTTTTCCTTATGGAATC 

CCTAGAGTG7AAACAGACAAGTTCATTTCAGCTCTTAAACTGCTTGCGTT 

TGTGTTG7GTTACCTGATTTTTTTGACTATTATATTTTTGACTAGCTATT 

GCAACAGAAGCTACTCTTGGGTTTTCAAGGAAGATTGTAGTTTAGACATG 

TAGAAATGTCTTTTAAAAAAAAAACAAACTTTTTTTTAAGTGCACTGTAA 

AAGCATCATATGGTCTAGCCTCCTAATAATTTTCCCTTTTTGGAGACCAG 

GATTCAGGGTGGGCTCTGCCCAGAGCTCAGAGATCCAGTTAAAAGAGAGG 

TAGTCTCGGCCGGGCGTAGAGGCCCAGCCTGTAATCCCAGCACTTTGGGA 

GGCCGAGGCGGGCGGATCACGAGGTCAGGAGATCGAGACCATCCTGGCCA 

ACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGTGTG 

GTGGCAGGTGCCTGTAGTCCCAGCCACTCGGGAGACTGAGGAAAGAGGAG 

AATCGT7TGAACCCGGGAGGCGGAGCTTGCAGTGAGACGAGATGGCGCCA 

C7GCAC7CCAGCC7GGCGACAG7GAGAC7CCG7C7CAAAAAAAAAAAGA7 

AGG7AGACTCGATGTTGTCGTACCCGAGCAAG7TAGAGCAACGCCACACT 

77GAGACGAA77TAAGAG7CCTT7A7CAGCCGGCGACCAAGAGACGGC7A 

ACGC7CGAAA77C7CTCGGCCCC77GGAAGGGGC7TGA7777CC777A7G 

C777GG777AGGAAGGGGAGGGGAGC7CAG7TGCAACAA7TC7ACAGGAG 

7AAAAACA7GCAAAGAAATTAAAAAGACAAG7GG77ACAGGGAAACAAAC 

AGTTCCAGG7GCAGGGGCTC7AAATC7ATCA7AAGA7GTTAGGTA7GGGG 

GCTC7GCCGGACACAAACTCAAGGCT77A7GC7GT7A7CTCTTGAGCGAA 

ATCC7GGGAAC77CGTACATTGC77GC7TCAG7ACC77ATCAG7TAATCG 

GAC7C777GA7A7GT7GGGAGTCAGCG7ACACAAG77AAC7CCITGAGGA 

AGGGGG7GGGTAAGGAGTCCTTGA7G7C7GG7AAA7GAAGGAGCGAAA7C 

GAGTTCC7C7GGCT77CTCAGC7AAGGGAGAGC77A77CATG7GGAAACA 

AGGC7AAG7GATTAAGGGAGAAAGGGAGAGTCTGAAAACAAGGTTAGG7A 

7TACAA7G7CAA7AAAATTGGTC7CC77A7ACAG7CC7A7GG7AGA777C 

777CCA7C777AA7C7CCCTC7AGCACCACCAGAC7777TCTC7C7G7AC 

C77GAGA7G7AAA7777GCTA7C7GAA7777CG7C7AAGAG77G777CC7 

T7AA7A7GCAAATTTAGGG7TA7T7AGC7GACAAC7GCCAAAG7AG7GAA 

ACAAG77A7CAAGAACT7GAACG7C7AAGG7AGGAAAAAAAAAAG7C777 

A7GAA7C7A7AAGATGTACTTCTA77GGCA7GCC7AATACGTC7ATGTA7 

77ACG7G77GTGTACACAGTTTTTCAC7ACTGAAAATATATAGAGGAGT7 

C7AA77AA77GACTTAAGACAATAAAAGCGCT7GAATCAAATACCTTATC 

AGGAAAAAGGAAAAGACAAGTCAAATGCT7GTTCAAGTCTATA7AAC7TA 

AG7AAAA7C777AA7AAA7AAGC7AGC777AACA77A777GAAA7G7C77 

AAGAA77GCCAGCAGGTTCTGGGTTACAGAfiC7AGTGGGGGTGCAG7GGG 

GTGAGGG7TGGTGGGGTGGGNGGTNNNACNNNNNCNCCCCCCCCCCCCCC 

CCCCCCCCCCCCCTCCCCCCCCGCCCCGNGCGGGCCGCGCCCCCCCCCGC 

CCCCCCGGCCCGCCCCCCGCGGCCCCCCACCCCCCCCCCCCCCCCCCCGC 

GCCCCGCCCCCCCCCCCGCGCCCCCCACCCCCCCGCCCCCCCGCCCCCCC 

CCCCCCCCCCCACCCCCCACACCCGGCCCACACGCACCCCCCACCCCGAC 

GCCCCCGCCCCCCCCCCCCCGCAGCCGACGCCCCCCCCCCGCCCGCCCCG 

CCCCGCACCCCCGACCCCCCCCGCCGCCCCGCCCCCGCCCCCCCCCCCCG 

GCCCCCCCCCCGCCGGCGCGGCGCCCCACCCCCCCCCCCCAGCCCCGACC 

GCGCGCCCCCCCCACCCCCCCCCCAGCCCCCGCCCCCCGCCCCGACCC 

>Contig20 

GGCAG7ACGC7ATAA77CCC7C77CACC77ACC7CA7CTGT7C7C7GA7G 
GA7G7AC77777T7T7TAGT77C7AAA77CCC7777CC777GC7C7GGAG 
ATGGG7GATTGATGTAGTCTGGG7ATTTGT7CCC7CCAAA7CTCA7GT7G 
AAA7G7AA7CCCCAG7GTTGGAGG7AGGGCC7GG7GGGAGG7G7T7GGA7 
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catgggggcagatccc . -a7gaa7agc .'gg7ac7g7cc7c7caatag - j: 
aatgagttctcctgagatatggttgtttaaaagtgtgtggcactccccca' 
ttgctctcttgttactgctttcgacatgtgacatccctgctccccttcgc 
tctctgccatgattgaaagtttcctaaggcttcgccaaaagctgagcaga 
tgtggg7gccatgcttgtacagcctgcagaactgtgagccaaaataaact 
tcatttccatataaattacccagcctcagatatttctttatagcaacata 
agagtggcttaatacaggctgggcatggtggctcacgcctgtaatcccag 
cactgtgggaggctgaggggggtggaacatgaggtcaggagattgagacc 
accggctaacacggtgaaactccatctctactaaaaatacaaaaaattag 
tcgggcgtggtggtgggcgcctgtagtcccagctactctggaggctgagg 
caggagaatggcatgaacccgggaagcggagcttgcagtgagccgagatt 
gcaccactgcactccagcctgggcgacaagagtgaaactccatttaaaaa 
g aaaaaacaaaat tt caaacag aacaaaatg aaaaaaatac caagtgaaa 

ggcccctataaaaacccctctggggcccatcctcccaccccctcaagtga 
aaccacatttaacaatttggtgcatatctttccaaaccttttgttgtaca 

CATA7AAAAAACA7ACA7GCTTTGATTTGGC7CAGACTG7ACA7AGTGT7 
TTCCCTCTTGCATTTTACACTTAATATATCTTTGACATCT*rTC7ATGTCA 
GTGCATGTTGGCTCGATGATATTCTATCATTAAATACCCTTCCAAAAATG 
GTAAAATCATTTTAAAAAATCATTCACACAAGTACATATTTACAATTTTA 
AAAGAAAACAGAATCCCAAAACACAACGACAAACCTCTAAAAATAATCTC 
TATCTTTCCACCAGCATGGAACAGTTCATTCCTTTTTCACATAAAACGAA 
TTATGTGATTGGAAAGATTAACTCTAATCTACACATTTATATACAGAATG 
TTCTATTTGTTAAGCCTATCTGAAAATAAAAAATTCAGATGATTAATTCA 
CTTACACTTAGAAATTAAGTCAATATACTATGAATACACATTGTGATCAG 
TTATAATATGATGCTTCTTAGTCTAGGGTTTCAATTAAATAACAGTAAAA 
AAAATTGGATAAATAAGACAGCTAATAACTGAAAAATCCAGAAATTCAAA 
G ATTAT ATTG C CAACTAAAACACTGC CATTTACATTTTTTTTTC C7ACTT 

GGTAGCAAATGCTAATGGAATTCAATCCTGATTACTTAAAGTCAGTTCAC 
ATCACACATTCAATCAGGATAATACGAACATAATATGCCTACTATAGCGT 
TAGATTAAGACATAAAATTTTTTTGCTTGAAAGTAATGACTGCGTACCAC 
TTGAGACATTTGTCAACCACTTCAGCACATTGTTTACGAGTGACTGGATG 
TCCACAAGGAATAAAAACGACAGCAATATTTCTATCCATACAGATTTTGC 
AAAGCTTCTCCTCTTGCAGGTGTCTTAGCTGCTCTTCAGTACTAATCTCT 
TTCTGCAATGAAGTCTGACTTGATTCGTCTTGTGTACTGTCTTTCTGAGC 
CTTCACTGGATCTGCAATCAGAACCTCAAGTGATTTACAGTTGCTCCCAG 
ATGTCTGAATTTTTTCCTCCATTATTTTCTTAATGTCTTTGAAACTGAAC 
CCCATTCATATAGCTTCTTGTACCATAGGATTATGGAAGATGGTATCAAT 
7TTTCTAGT7AGTGATGGCGTTTTTTCAGCAGTTCTTACCAGACACTCCT 
CAAG7GAA7GGGA7AAA7GAA7A77G777A7A7A7777CG7G7C77C7G7 

7C7AACAGA7A777ACACCC7GGATGCCA77AACATG77G7CCCAAGGG7 

C7TNC7GGGC7 

>Concig21 

CTTTCTCCC77TTTACCCCCATTTTCGTAGGGATT7GGTTAAAACCCATG 
7AAAAAATCCAAACACCGGCGGGGAACGGGGGT7CAAGCTCGTATCCCCA 
CCAC77TGGGAACCCAAGGTGGCAGGATTGTCGGAAGCCAGGCATTTGAG 
CCCACCC77GGGAAAAAAAAGAGAACCCCCA 77777 777GAACAAAAACC 
CCAACCCTCCCAGGAAAGAAATAAGTATGGCTGGGTTGAAGTCACCAAAG 
ATGGCCGAC7GGCTGGTCAAGTAACTTTACCTGATGGTTCGTAGAATATT 
7ACC77CACCCAGGTGGGAGAATTGCTTGAGCCAACCCTCAGTGTGGATT 
CAGGAAC77GA777AA77GGTATCGTGATTGTGGATTAGATTCTCAGGGA 
TGCA7TCAC7AAGTAAAAGTGATAATAGCTACT77TAAGTAAAATAATGA 
ATGAA7CAAACAC7CTAAATCCATGG7GCTATGC7AAGCTCT7TCTGTAT 
7TTA7CTCA7TTGATATTACAAATATTTGATGTG7TAATAG7AATGACTA 
TCTCCATTTTTACAAGTAAGGAAACTGACATTGAGAGATTAAAAGACTAG 
CACAAA7CACAAAG7AAA7GAGA777GAA7CCGG7C77GA77CCAAAC7C 
7ACAG7A77C7AAA77CAAGGAGAC7AAA77A7AAGA7GGAGAGCCAA77 
7TAC777ATAACAGGGT7AGAATGGCAGAAGAGACCTGACATTCACACCT 
CTAGCCAG7GCA7CATC77CCTG7AGGCAAA7ATGCAGGAAATCTATAAT 
AAGAACG7CCTTTGGTGAAGGCCAGGTGCAGGGGCTTACACT7GTAATTC 
CAGCAC777GGGAGG7CAAGGTGGGAGGG7CGC77GA7GACAGGAGT77G 
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AGAACAGCCTGGGCAACATAGTGAGAC^wTGTCTCTACAAAC^^ 

ACACAAAACAAC77CAAGAAAAC7CC777GG7A7GGA7CAGAACAAGA7G 

AATTATCTATCTGATCCAAATGCTTAATGACATTAAGCCACAGTCCACTC 

ACTGCCACAATAGAGATATACCTGCCAATGCCACTCAGGTAATCCCATCA 

AAAG7GG7AA7GAGG7C7GCAGCA7GAC77G77C77AG7GA7CCCAGCC7 

GAGACCTTGAGATTGCAGCATTTTATTCTACATATGCACAAAACATCTGT 

7G&AAAA7C77C7AAA77GA7GCAA7ACA77CG7A7CAAGAA7ACC7G7C 

TGTAATCTCCATAAACCCTCTCC7TTCTGTTTTAAAAAATAGTAACAGCA 

TTTCTCCTTACATGACAAAGAAATGACTTCACCATCTACGAAATAGTGAA 

TAGGAGCTGTGTGGAAGGAAATTAGCTCTACTTCTTGGTGGAGATGAGAA 

GGGAGTG7TCCTCTGAAAATCAAGGCTCTTGTCATGCTAGGAGCCAAAGT 

CGTTT77TAGAGTGTGGACAGTTGAGAAGATAAGACAGGGACCATCCACT 

CATGTTTTTCTTATTCCATAGGCCTCTCTCAATTGGGCAAAGCACTCCAG 

ACCTTT7GGAAGAGTGACACCAAAGGCAAGCACCTGCTTGGCAGGCCCCT 

CAGCT7CTACGCAAGTATAAGTGAGTATATAAAATGGGGGTACTTGTGCT 

G77GAG7ACC77A777CCAAA7GAGGCC7GCCGG7G7CCC7GTGGC7G7G 

AGAAGGCC7C7AC7GGA7AGG7GGAAG77G7G7G77C7CA7C7777C7AA 

CCC7GGA77GAC77GCCCAAAAGGAAGCCA77A77AACAC7A7AA7AAAA 

CCA7CC77AA7C7GGGAC7C7C77CA7GCAG7GG77C77AACCAG7GA7A 

AACA7GAGAG77AC7777GGAGC77AAAAAAA77AAGA7GC7CAAGG7C7 

ACCCAAAC7GAC7GAA7C7CCAGAGG7GAGGCCCAGGGA7G7A7AC7777 

GAGCCAGACC7CAG777ACCC7GCAGAGC7CA7AAGG77GCA7AACACCC 

777G7CAGCCAC7C7GA7GAAAAGAAAAA77GG7GAGGAA7AAG7777AG 

AGAAGAAGGAGCAAAGG7G77C77GGCCAG7GAGAGCCAA7GACAGGGAA 

A7GCAAACAA7G7A7CCACAAGAAAGG7AAA77ACCC7A7AGAGCA7777 

AGGA7AAA7GAACA7C7CA7GCC7AGGG77GAGAGAGGG7ACAAAAAAAA 

AAAAAAAAAAGACCAC7C7GGA7ACACAACGCGA7AAA7GGAA7AAAGAA 

777777CC77G7AAA77AAAAAAA7CC777G77AC7GAGG7A7AA777AA 

7C7A7777A7G7A7AG77CAA7GAGG7G77A7AGA7AA7AAA77777777 

G7AAA77A77A7A77G7CA7A7AC7CA7ACA77CA77777AAAAG7CAGA 

AATG7A7A7AACCA77AAAC77A7AAA7CA77CAG7CA77CAGAGA7A7A 

GA7ACACGAGCA7A7777A7A7CCACCACAA7AA77A77ACCA7C7CAAC 

AA77CCA7CACCCC7CAAA777CAAGCG7AGGGG77777AAA7G7CAAAG 

GAG7C7AC7CAG7GGGAAGAAAG77AAGGAAAAAACC777GGGGC777GG 

GC7CC77CCCCC7GGGG77AAAAAGGCAGGAAA77GGGC77ACCCCCCC7 

GAAA77GGGAAC7GAAA7777GGGAAG777AAAAAAAAAAAAA 

>Contia22 

TCAAGCAGCC77CC77CC77GGC77CCCAAA77G77GGGA77ACAGGCA7 
GAG7CAGGA77CC7GGC77AG777ACA7777C7AGAG7777G7A7AAA7G 
GAAACA7ACAGAA7G7A7777777GCGGAG7GGGGGAG7G777C7A777C 
777C777CCA7777CCCCCCCCCNCCCCCCCGAGACGGAG7C7CGC7C7G 
7C7G77GCCCAGGC7GGAG7GCAG7GG7GCGA7C7CGGC7CACCGCAAGC 
7CCACC7CCCGGG77CAAGCAA77C7CCTGCC7CAGCC7CC7GAG7AGC7 
GGGA77ACAGGCGCCCGCCACCAC^CC7GGC7AA77TTITTrG7A77777 
GG7AGAGACGGGGrr7CACCA7G7TAGCCAQGA7GGTC7CGATC7CC7GA 
CC7CG7GA7C7GCCCGCTTCGGCCTCCC7AAG7GC7GGGAT7ACAGGCG7 
GAGCCACCG7GCCCGGCCCAAG7G777C7A777C77AACCAGC777CA7G 
CAA7C777777TA7TT7ACCA7C7C7GTGA7CCCACTCCCAAAGG7AC7A 
GA7G7CGA77GG7CC77AGGA7CAGC7ACCA777GCCCAAC7GC777CCA 
GCC77CCAAAAA7777777C7777777C77AAAGA7AC7CC7G7G7GAGG 
C7CAGAAC7C77GAA77GC7AC7GCAAA7A7GAAC7CGG7GA7G7GAATG 
CCAGGGAA77GCC7GA77GA7CAAAGAAA7G7A7CCCC77C7CCC7CAC7 
C77GC7G7C77C7CA777G7T77CCCCA7CC77G7GGA77CG7GAA777A 
AA7A7CCC777AA7G77A7AA7A7777AA7GGCG777GGCGAAAAG7ACA 
GAA77AGG7GCAAGAG7GCATAGC7G77A7777777777GGCC7C7GAGA 
C7G77CA7A7A7GCAAGTTAT77AACAGAAAG77C7GCAG7GACC7GAGA 
7G7CAGGGGGG7C7GA7AGAG7ACG777GAAGGCAG77AC7GGAAAAAAA 
7AA7GCCA777C7GG777G7AC77CGG7AAG77CAGA7GACCCAA7A7A7 
7G777ACA7G7GGCA77CAG7AAAAAAG7AGC77CCCC7CCC777C77C7 
7CC7777C7CC777CC7GC77C7A7AAAGCA7C7GC77TGGGAAAC77C7 
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TAGGAGGAGAGCTTGCCf^JCCCGTGGL .^7GGAGAGG7C77GGMA'GA . i 

AAAAGAGATGCTCCCACTCAATGCAGGATGGTGTGGAGG7AAATGGGGAT 

ACG7C7GGCA7CAC7CAGGAATGGGCC7TCC7GGCAGGGAAAAAAAGGGA 

GGGGAAAGAGGAAGGGAATTCNNANATNAATTGCTGAATACGGGGATTCC 

ATGGCCTGGATCCAGGAAGAGAACTTTGGGAGGTGTGAACCTGGAAGGCA 

TCANCTGATGAGGAGCAGCCTGAACTCCGGGGAGGACCTGTTTTTGGTGG 

CCCGGAAAAAAA7GCCTTCCACACACAGGGAGGCCACCCGGC7GA7GGGC 

TdGGGGTTGGACGGACAGCCCTAGGACAGGCTTGGGAAACCAGGCTCAGG 

TAGGGCCTGCGAGGTTCTCGCTGCGTCTCTTTCCTTCCTGGTCTTAGAAA 

ATAGAATCCAAGGCCTCTTGAGAGTGGAAGGTGGGTTGGGAGGAGGGCAG 

ATGGGGCTTAGGCCCAGGACACCCGTAGAGCTACTGCCCAGCTGTCTCTC 

AGGGACTCTGCTGAGGTCACTCCAAGGATCATTCTTAGCCTTGCTAGACA 

GTACTGACAGAGGGAACCGTAGTATCGCACCCACTTCCTTCTCTTTCAAT 

GAAAG7TTAAAGGTCACCATTTCCTCTGGCAAAGGAAGTTCCACAAATAT 

TCCATTTCCGGTCTTAGAAACAGCAAGGTATCAAGCAATTGCAAACTTCC 

TGTGC7GGGGAATTCCCAAGGAAGTAGGGGCAGAGTTCTGGTGGAGACAA 

AGTGAATTCCGAGTGATTAGTCAGTAGCAGTAGCAGTAGCAGTAGCAGTA 

GCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGCAGCAGAACC 

AGAATTTCCCCGCACGTGTCTCAGGCTCTCATTTGCCAACTCAGTCTCTA 

AGTATTTTTATTGGCAGGAAAAATAAAATAGCTATGAGTGAAATAATTCA 

TTAGACCTGAGCCTCCATCAATTTTGTGTTTAAAGGCCTGACTCTCTTTA 

CCTTTCCCTGGGATGGAAGATGCAAATGTTCCTGATCTCACTGTCAAAAA 

AGAAGAAC C AGTGGGTAT ATTGTATG CTTGAGTT C C AG C CATTAGT CACA 

AGACA7AGAGATGACTGCCATGTGTGTAGACTTTCTATAGACTGTGTGCT 

AAACCCGACC7GCCAC7TCCAAGGAG7AGA7GAGGAA7G7CCA7GG77C7 

GGGGAGCCC7ACCCCAATTTGGGGCAGACA77CCAAAGC7CAT7T7C7G7 

GGAGGGGG77GA7GG77AAAGGAACGGC7GGGA777AC7C77C777C7AG 

GGCCAAGAAAATGACA7GC7GCC7CCA7G777AA7CA7CC77CCCCC7G7 

7AA7AAC7A7GGCT7TAAG7CCCCGG77AGGGCC77CC7CCAAAA77GGG 

GAAAAAAA7TCCCCTCCCCCCCTAAAAA777T7T777TAAAAAAACC7TT 

77T777GGGGG77GGGAAAAAAACCAAAAA7777777TCCCCAGGGG777 

TTTAAT77AAAT7TCTCCCCAAAAA7T7G77T777777TTCCGCGAAAAA 

AAGACCCCCCCAAAAAAAAAAAG7T777TGGCGGAAAAAAAAA7A7TTT7 

7TTG7G77AAGAAA7GGAGAAGAAGGGGGG777T7TT7TTCT7CTCCCCC 

CACCCGCCAAAGGAAAGGTTG7TCACAGA77G7T77G7G7C7CCCGCCCA 

7 

>Contig23 

A7G7GCC7GCGAAA7CA7CC77CCAGAAA7A777GCCCC777C7777G77 
A7AGAG7GGCAC7GCCCTA7A7GG7GACCAC77GCCACA7G7GGC7GT7G 
AACAC77GAAA77GGC7TG7CAGAA77GCAG7G7AAAG7G7AAAACACA7 
ACCAAA777CAAAGACA7GGCACA7AA7AAAAAA7G7AAAA7A7C7CA77 
AACAA7777TATAT7GACTGTG7AAG7AACA77TTGAATA7A77GGA77A 
AA7ACA7GGATGATGCCCCAACACCCACAGTCCCTTATCAAGTCTC7ACT 
7CACA7777TGTACTTCTGACTTAGAAATAGCACTGGCGTCTAAGAGCCT 
A77AA7G7CG7CAA7AGG77C77GGGAACCACAA7777AAACAAAA7GAC 
A7A7AAGAAAACGAA7AACA77GAACAAAATGACA77A77CGAGGACC7G 
C7GCA7G77G7T7CACTTAAAG7CAG7G7CCAAGAAACTATCAGTGACAT 
77AG7GAGGAATTGCTGTCCTTCC7G777ACAGGAACCTGGGCAAGT7AC 
77AA77CC7C7AAGCCCGG7TTA7A7CCC7GCAAAGAGAGAAGGATAATA 
A7CACCAG7ACT7AGTGATGTCGTAAGGAGAAAA7AAAATAATAAATATG 
AAA7GGC7GACAGTG7CCTTGTCACACAGAAGATGTGTGA7CCACAGTAG 
C7GC7A77G7C7GCC7CAC7TCAC7AG7AA7GGTCCAGGGAGGCCTTTAA 
7GTGCA7GG7GCAG7ACA7TCACA7G77GGACA7GGG7GAAGGGAAAGAC 
CAGGC7CA7C7AAACACAA7AGGA7GC77G7GG7G7777GAGGAGGAA7C 
AAGGAC7AG77A7CCACAGC7G7AACA7GCA7GGA7CAAAAGAGA7AAGG 
CACACAAAAGACTTTGTCAGTAGCAAAGCA7TACAAAATGCAGAGACCAG 
C7G7GGG7GG7GG7GAGTCAGACCCAGC77CCC7C7G7GCC7GGC7GAG7 
GG77C7GGGCAAG7CACGCCA7C7G7C77GA7GCCCT7CCCCA7CTATAG 
AGAGGGAGCAACTGAGGCCCC77CCAA7AC7GAAG7CCT7TA77TC7GCT 
AC777AGAAA7A7CCACATTT77GGTAAA77CAAA7GATCCAA7GA77CC 

FIG. 3 (11 of 52) 

is/ur 



WO 99/06426 



PCT/US98/16102 
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AACACACACACACACACACAGAACCCATAAGCCT7CATCTTTCCTTGGAT 
AAACGAGCCTTCCTGTCTGGCCATTTAAGTCACGATTAAGTAAATGATTT 
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AATAAACTAAGATAAAL ^GAATAATTAAAACTTAATTTAAAAGAAGA^ 

GGAAAGGAAGCAGTTACATTAAGCAAAAGAGACATCTTC^^ 

AGTGTATGCCCTGGTGTCTGGATCCCATTTAGGAAACTTGGTAACCTTGC 

AATCTTGGGCAGATTGCTTAATTTCTCTAGACCATGACTTCCTCTTCTGT 

AAGATGTGATAAGAACATCTACCTCACAGGTTTCATGAGAGGATTAAATG 

AGATAATGTATTATAATCCCTTGAACATGGTAGGCTGTTATGTTAAGTCC 

TTTCCTCCTTCTCTGTAGCTATCATGGAATTTAAAAACACATTATAACTA 

GAGCATGAGTTGCGACTAAAGGCTCAATTGTCTCTGCATGTGTTGGCTCA 

TGCATGCTTTATTCCTCTGAAGAGCTTTTATACCAAGTGAAAGGAAATAA 

TTGCATTTCCCTGAAAATTCACAGGAAAAAGTTATGTTTTTCTCTTCATT 

CAAGTGATTCTGTTAGACCCAACCACATGCAACAATTTTAAAGTTGCTTC 

CAAATATATTTACAAATATTTCCTGTCTTCAAGGAACAATGGCZAAGACCA 

TGACTCAGGTTCACATCCGGATTCCACCACTAACCATGTACCCAATTACT 

TCAGTCACCTTCATTCAGGTCTTACATATCACAGAATAAAATCAGATTTC 

ATCAGAGGAGGTGAAGACAGGGAGAGGAGATATTTCAATCCCTTCTCCGC 

AAC C CC CGTTTTTTTTTTTTTTTTAACAAGGATCCTAGAGTTACTGAATG 

ATAGCACGTTTGAGGGGGAAAGACCCTAAGGATGATCTTTATAAGCCATC 

ACTTGGTGTTGGTGGTGATAAAAAACTCGAGTATCTTTATGCAGTGGAAA 

GAGAAGATTGGACTCGGAATCAGAAGCTTGAGTTCAAGCACTGGTTTCAT 

CAGTCTTGTGATCTTGGGTTGGTCACTTAACCTCTTCAAGGGTCCTCAGC 

TGTGAAAGAAGATAGTATCAGCTAATTCTTGTATGTGCAGTGAGG^GGCA 

GTGAGATAGTGCAGGTAAACTATAAAACAATTGTCACATGAAACGCATCA 

CAGTGATTCTTTGGACCCACAAGCTCCAATCTTATAAAACATATCCAGTC 

ACCCACCAACATAGATCATCTCACCTTGCATATCTGATTTTGTGGATCAT 

GGGGAAAAACTGCTGATTCCTAGCAAAACCCATGGCATAGGATAAGTGCA 

CAATAATTTTTTTTTCCTAAATGATTTAGATGACAGTGACTCATTAAGGG 

TTTCCTGAGGCCTCCTCAGAGTCGAGAGGTGGGTGCCTGAAGCCACCCAA 

AGTCCCTGTCACAGGATGGCTCCCAACGCACACACCACAGGCCTGCCCAG 

TATGTTCCACTATCTACCCAGTAGAGCCCTGCCCAGiTACGTTCCACTGTC 

CCTTCCCTAGAAGAGGTGACTGTTGTTCACAGTCCCAGAAAAGCGGGCTC 

CCCAAAACAATGCAAGGACCCACCTCTCTCTGAACCTCACCCACCCTAGT 

TTTCCTTTAAAAATCAATTTACAAGAAGATCATGTGAAGGAAAAGGTTGG 

GTGATATTCTAACCCAAGTTAGCTGTTTCTCAACCAAGTTCTCTTTGAAA 

AATTCAACAACCACCTTTGGGGAATTATTTACAACAGAGGAGTGAGGATG 

GGACCAGGATAGGTATTGCCTATGTTGGTGGAACCAGGGTTTTTTTCCTG 

GATTACCAAAGAGATGGTATGCATTGCTCCCAGAAGCTAAATATCTTCAG 

GCTTTCAATGGTGGCCTTCACCTGAAAATGTTATCCCTGTTGAAGCTTTC 

AAGCCAGTATTTTCATAAGAACTATATTTTCTTTGGTGAACTGAGGCATT 

ATAATGATGACTATACAGGTTCTTGAGTGACTGAAGCCATCATTAGCATT 

GTCATTATTTTTGTTTAGTTGCATCTCCATAGCAGCTCACATTCACAATG 

TGCTTTGCAATTGTTCCTTAGCAATAGCCCTCACAAGATTCTCAGGAGGA 

GAGGGTTAATCCGGATTAACATTTCTGTGAAGCCTAGCGAGATTAATCGC 

>Contig25 

AAGAGTTTTAAAATTAAGTAAGGACGCCGGGAAACAAATCAATCCCAGCA 
AACATTTTGTTGGGATTTATCATTCAAGCAATTTTACAGTTATCCCTGTC 
AAATACATTAAGTGTTCAAAATTGGGCATAGGGGGAACAAAATAATAAAC 
CCAGCCAAAACAGAATAATCCCTGTTTGTTCAATGTTGGATAAAAAAGAC 
ATTACTATTGGTGTAAGGAAATTAGATACATCTTCCATTATTTAGTAAAA 
TTACCATAACTTCTAACTTTGTGGCTTTAGGCAGTCTAGTCCACAGGCAG 
GAAGGAGGTTTGTTTTGGCAAATGACTGTTATCATCTTCTGTTTCAAAGC 
TAAACCATAAACTAAGTTCCTCCCAAAGTTAATTCAGCATATGCCCAGGA 
ATGAACAAGGACAGCCTGGACGTTAGAAGCAAAATGGAGTCAGGTAGGTC 
AGATCTTCTTCACTGTCTCAGTGATGGCAGTTTCATAACTTTAAATGATG 
GCTATCACAGTTTTCATAAATAATCTAGATAAACAGTTAAAATAAAATAA 
TTAGG7AAATGTAGTGCGATAAATATTAGTAGACAAACTCACCATAATTT 
AGAATCTAAAGTTAAATTAAATAATAATATTTCATTATTTGGTATTTTCC 
AAGAAAAACATATTGTAG GAAA CCATTCTTTTTAAAAAAAAAAGTGTCCT 
TTTAAAAAGGTGAATAATTTTTGTCTAATTCAAAGTTTATTGAAAAGTTA 
TGTATAAAACAAGGTAAAAGGAACAAGGAAATAAGGGAAATGTAAAGAAA 
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ATTATAGAAATAAAGTCjw rATTTTTTGGTAAGAAAGCTTAAAOi&GA^.* rt . 

ATTTTAGGTAAGAAAGAATCTTACCTAAAATTTTGTGCTAGAATAAAGTG 

ACTGGCTAAGAAAGGGATGTTCAAAGCTATTTATGACAAACCCACAGCCA 

ATATCATACTGAATGGGCAAAAGCTGGAAACATTCCCTTTGAGAACTGGC 

ACAAGACAAGGATGTCCTCTCTCACCACTCC7ATTCAACATAGTATCGGA 

AGTTCTGGCCAGGGCAATCAAGCAAGAGAAAGAAATAAAGGGTATTCAAA 

TAGGAAGAGAGGAAGTCAAATTTTCTCCGTTTGCAGATGCATGATTGCAT 

ATTTAGAAAACCCCATCATTTCAGCCCCAAAACTCCTTAAGCTGATAAGC 

AACTTCAGCAAAGTCTCAGGATACAAAATCAATGTGCAAAAATCACAGGC 

ATTCCTATACACCAATAATAGACTAACAGAGAGCCAAATCATGAGTGAAC 

T C CCATT CACAATTGCTACAAAGAG AATAAAAT AC CTGGGAATACAACTT 

ACAATGGACATGAAAGACCTTTTCAGGGTGAACTGCAAACCACTGCTCAA 

GGAAATAAGAGAGGAAACAAGCAAATGGAAAAACATTCCATGCTTATGGA 

TAGGAAGAATCAATATCGTGAAAATGGCCATACTGCCCAAGTAATTTATA 

GATTCAATGCTATCCCCATCAAGCTACCATTGACTTTCTTCACAGAATTA 

GAAAAAACTAATAGCCAAGACAATCCTAAGCAAAAAGAACAAAGCTGGAG 

GCATTGTGCTACCTGACTTCAAACTATACTACAAGGCTGCAGTAACCAAA 

ACAGCATGGTACTGGTACCAAAACAGATATATAGACCAAAAGAACAGAAC 

AGAGGCCTCAGATATAACACCACACATCTACAACCATCTGATCTTTGACA 

AACCTAACAAAAATAAGCAATGGGGAAAATAATTCCCTATTTAATAAATG 

ATGTTGGGAAAACTGGTTAGCCATATGCTGAAAACTGAAACTGGACCCCT 

TCCTTACAACTTATACAAAAATCAACTCAAGATGGATTAAAGATTTAAAC 

ATGGCTGGGCATGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCC 

GAGATGGGTGGATCATGAGGTCAGGAGATGGAGACCATCCTGACTAACAC 

AGTGAAACCCTGTC7CTACTAAAAAATACAAAAAATTAGCTGGGCATGGT 

GGTGGGCGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATGG 

TGTGAAACCAGGAGGTGGAGCTTGCAGGGAGTGGAGATCACGCCACTGCA 

CTCCAGCCTGGGCAACAGAGTAAGACTCCATCTCAAAAAAAAAAAAAAAA 

AAAAAAAG AAGGATTTAAACATAAG AC CTAAAAC CAT AAAAAC CATAGAA 

GAAAACCTAGGCAATACCATTCAGGACATAGGCATGAGCAAAGACTTCAT 

GATTAGAACACCAAAAGCAATTGCAACAAAAGCCAATTGACAAATGGGAT 

CTAATTAAACTGAAGAGCTTCTGCACAGCAAAAGAAACTATTGTCAGAGT 

GAACAGGCAACCTACAGAATAGGAGAAAATTTTTTCAATCTATCCATCTG 

ACAAAGGGCTAATATCCAGAATCTACAAGGAATTTAAACAAATTTGCAAG 

aaaaaaaaacccatcaa; AGTGGGCAAAAGATATGAACAGACACATCTC 

AGAAGAAGACATTTATGT GGCCAACAAACATGAAAAAAAGCTCATCATCA 
CTGGTCATTAGAGAAATGCAAATTGAAACCACAATGAGATACCATCTCAT 
GCCAGTTAGAATGGCGATTATTAAAAAGTCAGGAAACAACAGATGCTGGA 
GAGGATGTGGAGAAATAGGAATGCTTTTACACTGTTGGTGGGAGTGTCAG 
TTAGTTCAACCATTGTGGAAGACAGTGTGGCAATTCCTCAAGGATCTGGA 
ACCAGAAATACCATTTGACCCAGCAATCCCATTACTGGGTATATACCTAA 
AGGATTAGAAATCATTCTATTGTAAAGACACATGCACATGTATGTTTATT 
GCAGCACTATTCACAATAGCAAAGACTTGGGAACAACCCTAATGCCCACC 
AATGATAGACTGTGTAAAAAAATGTGGACGTATACCCCATGGAATACTAT 
GCAGC CATAAAAAAGAATGAGTTCATTCTTTTGCACGGAACTGGATGAAG 
CTGGAAGCCATCATTCTCAGCAAACTAACfiCAGGAACAGAAAACCAAACA 
CTGCATGTTCTCACTCATAAGTGGGAGTTGAACAATGAGAACACATGGAC 
ACAGGGAGGGGAATGTCACACACCAGGGCCTGTCAGGAGGTGGGGGGCAA 
GGGGAGGGATAACATTAGGAAAAATACCTAATATAGATGACGGGTTAATG 
GGTGCAGCAAACCACCATGGCACATGTACACCTACGTAATAAACCTCCAT 
GTTCTTCACATGTATCCCAGAACGTAAAGTAAAATTTAAAAAAGAAAGAA 
AGAAAGAAAAGGATGTTCACGACAAACCAGAAAGTCCAAGCATGTCATGA 
ATAGTCTGTGTAAGTCACAATAAGAGGATTTATTTAAAAAAACTTTTATA 
TGATAAAGTTGTCTATAATTAAAGGGAAATTATAATGGTCTTTCTAGAGA 
TTGGGTTGATGTTAAAAAACTACTTATATATTAAAAAATTGGTTAGAACA 
ATGAAATTTTCTTACGGGGTTGATTCACTCTTAATAAATTATAAGAGACT 
TAAGAATTTTTTTTTAACCCAAAGTTCAGCTTTTATTGCATCTTGCTGTT 
TTAGGTTTTCTCTCCCCTTTAAAAGGGTGGGAAATAGTAATGCCCTCCTT 
CAACTCCCT7CAGCTCATATACGTTTTTTACCCTCAGATTCTGTTTGTTG 
TGTCCTGATGCTAACAATGTTTTCTTAAAGGTCTAAAGGAAATGTTTTCT 
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TCCAACA7AA7A77C7G1 oCATTGCAGAAGGTCTTTTCTTTTGCCTTTTO • 

GTAACTGGC7TAACAGATTTTATGTTTTATTGAAATAATTTCTATGCCAT 

TATTATTAAGTTTTGGTTTGCTTAGAAAACACTGAGATTAATACAATTTT 

TTAAAAATTATGATTATTACATCCATATATCTTTATGTATGTGCTTTTAA 

AGTCCTTGTGACATTGAGTTCTAGGGCTTGACTCCTGGGTCTTAAAAGGA 

CAAGTCCTGCTAAATCTTAAATACTGACAGCAATTAAAGGCTCATCT7CA 

GGAC7GGTAGAAAATGCCAATCAAAATAAACTGCATTCTTGAAACACAGA 

GCCAGAAAT7AAAGCTATTCAACTCAAGGCCCAGGAACTATAGTGGAAGA 

GG7GGG7G7G7GAGA77G7AAGGGCCAA7777GAGAGA7AAAA7AAG77C 

AA777C7C7A7AAA77AA7CA7AA7CA77GA7G7CCAAGCCACAC7GA7G 

CAAGA7CAGCA7A7GGG7CC7G7G7CAGA77AACAAGG7777C77GAAGC 

A77AACC7AC7CC77AA7AAAGG77ATAGAGG77A7AAAAGGC77C7GGA 

AG77A7AGC7A7GG7CAAGA7AAAAA777CA7AGA77GT7AA7ACAA777 

7GGAAAACAAA777AA77GGC77C77GC7G77777A77AGGGC77A77G7 

77GGAAAA77AAG7C7CG7C7C7CAAAGAA7GAAGGC7T7CACC777777 

777777777777AA7CC77GAG77A7CAC777GG7CAAA7GAA7GAC77A 

T777ACAATGACC777CA7CAAG7G7777AAACC777CAAAT77GACAAA 

C777CCAAAA7CAAAC7ACAAA77A7G7C77777A7GACC7AATGAA7CC 

777AAAA7AC7AGG77CCC7AAAG7CCAAAAAAAAAAAAAAA7AACA7AA 

7G7GGC77A777GG7ATAAAAA7777ACAAGAAACA77G7CAAA7A7AAA 

A7A77G7G7GG7777G777GGGC7G7A777G7A7AAA7A7G77A77GG7A 

7G7G77CCAAAA77A7AGGAAAC7CC7A7AA77C7GA7A7GAC77GG7G7 

ACA77A7CAG7AA7AA77A7AA77G77A7GG7AAA77A77G7G7GCCA7G 

GAGG7AACAAA777CC7CA7CAAG7G7G7C777GAC7A7GG77GCCC7AA 

AAC77777GCCA77CACAGACAA77G7C77GC777GG7CC7C777AGAAG 

G7GG7777A7AA7CAGC7ATAAAAC7C7AACGGG7GC7C77GAA7GCAGG 

C77AAGA7AGC777GGAGAC7G7GACA7CAGAA7AGAGGAAAAAC777CA 

G7A77CA7GGAG7GC7GAAA7A77CA7GAA7A7CAAGCAAAACAGGAA77 

AAC77CA7AGA7GGAAC7AAAAGAA7GC7GAAG7AA7CT7777GAC7777 

7TTC77AGAA7G77GA7CC77CG7777G77777CAGAG7CNAGGAAA777 

77C7G77GAGA7A77GACAGC777AACAA77AAG7A7AC7CCAG7GAACA 
CAA777GGAGCA 

>Contig26 

A7C7AG7CA77CCCCAGCC7GACCAA77CAA7GGCCCCCA7C77AG77AA 
AA77CC7CACCC7GACAAGGCCCCA7C7ACGCC7C7GACC7CA7GCCC7C 
CAC7C7CAG7C77GCACTCACCC7GCCACAC7CAAGGGC77CCCCAGG77 
CC77C77AGA77CCACCGA7AGC7CAGGGAC777GCACA7GC7ACGG7C7 
C7GCC7GGC7CC7CCCCAGA7C77C7CA7GCC7AGC7GC77C7CA7CAGC 
ACCCC7CAGAGAC7G7CCC7GCCCCACC7C7CCAGG77CCA7ACC7GCCA 
CCC7CCCCCAA7CACG7AACAG777C77CACAGAGCGAG77ACCA7CCCA 
G7A777CCC7AAC77A777777G7GAC7GG7C7G77GCC7G7C7CCACCA 
CAAGAACA7AAGC7GCA7G7GAACAGGAGCC77G7C7A7C77G7CACCCC 
AG7GGC7G7GACA7AACC7GA7ACACA77AGA7GC7CAA7GA7G777GA7 
GAA7GAAG7GC7GG7AG7CCAAC7G7G777CCT7G7C7GTG7AAG7A7G7 
C7G77G7GG777CC7AAGAACCTACAGC7C7CCCAC7G7GAC7CC7G77C 
7A7GG7CC7GA777GCTGGAC7AGAA7CC7AACC7ACA7GC77AC7C7TA 
G7G7CC7CCCCCAGAGGC7GAA7CCCAG7CCC7AAACC7CCACCAAA7GG 
C7AAGACC7AGC77CCAACCAGACAGGCC7ACGCTGAGACC7CAGCACCG 
CCC77C7GCGG7C7CA7CCT7AACGCA7CC77CAGGGCCCAGC77AAA7G 
7C7C77C7CCAAGGAAGGCTA7CC7C777C7GCCCC7CAG7GC7C7CCA7 
GCC7CC7C7A7GCC7CCA7GCC7GC777CAACCC7GCAGAAG7GGAGAAA 
T7GC7AA7C7GC7G7G77GACAC7G7GC7GGGG7GCCT7GGGCCAGGGAG 
CAGGC7GG7GG7G7GC7GA7AGCCCG7GGC7G7GCCCAGG7CCA7GC7CA 
C77CC7GAGCCCCAG7GGAG7AGGC7CCC777CCC77A77GCAGCAC7CA 
GAGGAAGGACG7GC77C77AGGACAGA7C7GGCCAACC7C7CCC7CG7GA 
GAGAAGGCCCAGCCA7CC7C77GCCC7C777C777C7CC7GCCCCCGAG7 
AA7AAAGG7GCC7GG7CAGAGCC77C7AGAAGGAGACCCAAACA7CCACC 
ACACA77CCCAG77CCAACCG7CA7CCACA7GGC7GGC7G7GCAGG7AAA 
CGCAGAG7C7G777CACACACCCAACCA7C7AG7A77GGA7GGGAGGACA 
G7AGCG7GACAC7C77C7CCAGCC77GAGCCC7AC7GTGGGCCCCACCCA 
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AGTCATGTACAAAGTTAGCCCTTTGTTATATAGAGTTAGCTACGTACATC 

77CC7C7G7AGGGAACCCAAGAGGGGAGAAGAGA7A7G7AG7AGGA7T7A 

ACCTGCAAATCCTCTGCTGAGCACCCTGCACTACATACAGTGGGTAGCAT 

GTGGTAGGTGCTCAATAACTATTGACCGATAGATTGAATACAGGTAGGAT 

GGTGACACAATCTAAGATCC CAGGGGTGGGGAGACCACACGCTTGGTTAG 

GGAGACCCAAAGTGGACCGTGTGGCCAGAAGAG7CCCGCACTGCACTCTA 

GTGACAGTGCAGAAAGTCACTGTGGGAAATCTAGAAGTTTCTACAGGTTG 

CTATTTCATCATAGCACTGTGCAGGCCAACCCTTCCTGCTCCACTGGCTG 

TTGGGAAAAGCTTTCTCTTTTCTTCCTAGCCAGGGAGCTCTCAAAGTGTT 

CCACTCTCTCACCTCCACCCAGGCGTCCAGGTGTGGAGGACACTTGCCGG 

CTGCTTGTCTGCTGACTCATCCCTTGGTTTCACTTGGAAAACCTACCACC 

AGCTGGCCTCTTTCCAAGCATCAGCCTCCTCATTTTCTTAATCCCTTAGG 

TGTGATCTCACCTCCACACAGTAGATTGCCTCAAGGCCCAATTCCAATAT 

GAATAAAAATGATTATTTTGTCATCTTCCAATCTTCCTTTTAAAATATTA 

TTTTATAATTCCCTTTAGGAGGATCACCTAAGTGAAGACTATTTTTACCT 

AAGAAATGTTAAAATGTAAAGACATGGTTGTAATCTGGGGATTCCTGTTA 

AAATGGCTAGCAGACAGAAGTCAGACGACAGGCTAGAAATGTGTGAAGAG 

TGGTTGCCTTTGAAAGGCGGAGTTGGTAATGATTTTCTTCCATTTTTCCA 

TGCTTTCCAATTCTCTACAAAGGCCTTAATATTACTTCGATAACCAGGAC 

CTCTGATAACCTGCCCCCACCGAGTAAAGACTTAGCTGGGAAAGTCAGC7 

TCATG7GAGGTAAAAGGAACCAGGTAATACACAATTCCCACTGCCAACTG 

TCGGGTGTGCAGGCCTGAGCTTCCTGCATGTGGGAGGAAAGAGAAAGAAG 

AGAGAAACTCCAAGATCCAAGAGATCCAGCAAGAAGGCTGGAGTCTGAGG 

ACGCAGAAAGCTGAATGGCACAGTTACCACTATTG7GCTGAGGTTCTGTG 

GCC7C7GGG7C7CTTGACAACTGGGCAAAGACCCACAGAAAAC7A7CTC7 

AGACCC7ACC7GTGGGAGGGGAAAG7GC77AAGA7CA777ACAGGACAGC 

CACC7GGACC7CAAATGGCTTACAG77CC77CA7CCAGAGGG7C77CA77 

7AG7ACA7ACCAGGTGCTAAGC7GGG7GC7GGAGACA7GACGGGGAACCC 

AT77ACCA7GGCTTTG7TAC7G7GACA77CACA7C7AGGGAAAGCCAGCA 

AAGGGGAGGGA7CGAGGAGAGC77G77AGGCAGAGAAAA7ACCCAAGGGC 

AAGGGAGAAGCCAGCC7GTTCTGAGCACACACAG7GG77CCA7C7AAC7G 

GGCC7CAG7GCCAGG7TGGACTGGAGA7GGGGC7GAGGAGCTGTCACAGA 

GCA77C7GGACACAGA7GTCACA7AG7CCC77GAGG77AGGG7CC77AGG 

CA7GGCAGCA77GCTTTGAGTT777CC7777G7AA7G77GCCA77CA7GA 

CAA7G7GGAAGATGGG7CCTTGCAGAGAAGGGCAGGGC7G7GAGACCAG7 

7AGGAGAC7AAGATG7GAGCCAAGGAAAA7GAGGAACACC7GAACAC7GG 

GGCAGG7GCAGGGCCCAGAGAGAAGCAGA7GGC77CC7GAGG7777AAG7 

AGG7AGAA7CAAGGCAGC7GGTACAGA7C7777A77ACA7A7AAAC7GGA 

A7AAGCCA7C7G7TCCAAGACAAAAGAG7AGGCGGAAAACAA7ACAAGAC 

AG AAA7GGAA77AGAACAAAC C7GGGAGGAA7G7GGAA77AGAG7AG AG A 

G7CCAACAC7GGCTGCAA7CA7AAAAA7G7AAAACAAACAAAAA777GC7 

AGG7G7GC77ACTTAGAAATAAT7AGC7G7CA7A77AAG77CAC77GTG7 

TA7GGC77AAA7GTGTCCCCCAAAA7G7GA7G7G77GGAAAC77GA7CCC 

CAA7GCAACAGAGTTGAGAGATGGGACCTTTAAAAGG7GA77AGG7CA7A 

AGGG77C7GCCCTCA7AAA7GAA77AA7AC7G77A7CA7GAGAG7AGA77 

CC7GA7AAAAGGATGATCTCTGCC7CC7CCCCACAGCCC7CT7G7GCA7G 

C777CC7GCC77TCCACCTTCTGCTA7GGGA7GACACAGCAAGAAGGCCC 

TCACCAAA7GCAGCTCC77GA7C77GGAC777CCAGCC7CCAAAAC7G7A 

AGCCAAACAAA7TTCTGTTTAT7A7AAA77ACCCAG7C7CAGG7ATTCTG 

T7C7AGAAACACAAAA7GGACTAAGA7CA77AAA77A7CA777777A7CA 

GAC7G77GA 

>Concig27 

AAAA7A7AACAGAGAG7AAGAGGAAAA77ACC77C777C77777CC777C 
CC7GCC7GACC7TAT7CACC7CCCA7CCCAGAGCA7CCA777A77CCAT7 
GA7C777AC7GACATCTA7TATCTGACC7ACACAA7AC7AGACAT7AGGA 
CAA7G7GGCC7GCC7CCAAGAAAC7CAAA7AAGCCAAC7GAGA7CAGAGA 
GGA77AA7CACCTGCCAA7GGGCACAAAGCAACAAGC7GGGAGCCAAG7C 
CCAAAA7GGGGCCTGC7GCT7CCAG77CCCC7C7C7C7GCA77GA7G7CA 
GCA77A7CC77CGTCCCAGTCC7G7C7CCAC7ACCAC777CCCCCTCAAA 
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CACACACACACACAACAbJCTTAGATGTTTTCT 

AC7CAA777G7AAG7A7A7AA7CCAAGACC77C7A77CCCAAG7AGAAT7 
TATGTG CCTG C CTGTGCTTTT CTAC CTGGATCAAGTG ATG7CTACAGAGT 

AGGGCAGTAGCTTCATTCATGAACTCATTCAACAAGCATTATTCACTGAG 
AGCCTTGTATTTTTCAGGCATAGTGCCAACAGCAG7GTGGACAGTGGTGC 
ATCAAAGCCTCTAGTCTCATAGAACTTAGTCTTCTGGAGGATATGGAAAA 
CApACAACCCAAACAACCAACAAAAGAGCAAGATGCTGCAAAAAAAAAAA 
AAATGAATAGGGTGCTAAGATAGAGAAAAGTGGGAGAGTGCTATTTAGAC 
AAAGTGGTAAAAACAAAGCCCCTTGTGAGATGAGAGCTGCCGACAGGAGG 
GGGCGGGTCATGGTTGTGGGTTTTTGGGTAGGACATTCAGAGGAGGGGGC 
GGGTCGTGGTTGTGGGTTTTTGGGTAGGACATTCAGAGGAGGGGGCGGGT 
CGTGGTTGTGGGTTTTTGGGTAGGACATTCAGAGGAGGGGGCGGGTCGTG 
GTTGTGGGTTTTTGGGACATTCAAAAGAGTCTGAATGCACCCAGGCCTAC 
AACTTCAAGATGGTAAAGGACAGCTCCAAGGATCAGAAGAAGCATGCTTG 
GAACTGGGGCATTTTGAGAAGGAGGAAAAATATGCAGAGACTAGTGCTTG 
CAGAGCTTGCATGTGGATTTCATTTGAGGTACAATGAAAACCCATTAATG 
GGTTTCACACAGTGCAATGGCCTGACCTCACTTATATTTCCTAAAATAGA 
AAACAGATCAGAAGGAAGGCAATAGAGAAGCAGAAAGTCCAATGAGGAGG 
TTTCACAGCAGTCATGGGGGTGGGGTAAGGAAAAGAAGTGGAAAGAAACA 
GACAGAATTGGGTTATATTTTGGAGATAGAACCAACAGAAGGAAGAGGAG 
AAACAACAT7TACTGAGAAGGGAAAAAGTAGGAGAGGAATAGGTTTGGGA 
AATAAATCCTGCTGACATTGGAAACCCCAAGGAAGCCTCAAAAGTATATT 
TACTTGCTTTAGATTTAAAAGAATAGGAAAGAAGCATC7CAACTTGGAAT 
TTGAAATCTATTTTTCCATAAAAGTATTGTTAAAT7CTACTCATACTCAC 
AAGAAAAG7ACATTCTAAAGAGTATATTGAAAGAGTTTACTGATATACTT 
AGGAATTTTGTGTGTATGTGTGTGTGTGTATGCGTGTGTGTGTGTTTAAC 
CTTCAAT7GTTGACTTAAATACTGAGATAAATGTCATCTAAATGCTAAAT 
TGATTTCCCAAAGGTATGATTTGTTCACTTGGAGATCAAAATGTTTAGGG 
GGCTTAGAATCACTGTAGTGCTCAGATTTGATGCAAAATGTCTTAGGCCT 
ATGTTGAAGGCAGGACAGAAACAATGTTTCCCTCCTACCTGCCTGGATAC 
AGTAAGATACTAGTGTCACTGACAATCTTCATAACTAATTTAGATCTCTC 
TCCAATCAACTAAGGAAATCAACTCTTATTAATAGACTGGGCCACACATC 
TACTAGGCATGTAATAAATGCTTGCTGAATGAACAAATGAATGAAGAGCC 
TATAGCATCATGTTACAGCCATAGTCCTAAAGTGCTGTTTCTCATGAAGG 
. CCAAATGCTAAGGGATTGAGCTTCAGTCCTTTTTCTAACATCTTGTTCTC 
7AACAGAATTCTCTTCTTTTCTTCATAGGAGATGCCTGAGATACCCAAAA 
CCATCACAGGTAGTGAGACCAACCTCCTCTTCTTCTGGGAAAC7CACGGC 
AC7AAGAAC7A777CACA7CAG77GCCCA7CCAAAC77G777A77GCCAC 
AAAGCAAGAC7AC7GGG7G7GC77GGCAGGGGGGCCACCC7C7A7CACTG 
AC777CAGA7AC7GGAAAACCAGGCG7AGG7C7GGAG7C7CAC77G7C7C 
AC77G7GCAG7G77GACAG77CA7A7G7ACCA7G7ACA7GAAGAAGC7AA 
A7CC777AC7G77AGTCA7T7GC7GAGCA7G7AN7GAGCC77G7AA7TC7 
AAA7GAA7G777ACAC7CTT7G7AAGAG7GGAACCAACAC7AACA7ATAA 
7GT7G77A777AAAGAACACCCTATAT77TGCA7AGTACCAATCA77TTA 
A77A77A77C77CA7AACAA7777AGGAGGACCAGAGC7AC7GAC7A7GG 
CTACCAAAAAGAC7CTACCCATATTACAGATGGGCAAAT7AAGGCA7AAG 
AAAAC7AAGAAA7A7GCACAA7AGCAG77GAAACAAGAAGCCACAGACC7 
AGGA777CA7GA7T7CA77TCAACTGT77GCC77C7AC7TTTAAG77GC7 
GA7GAAC7C77AA7CAAA7AGCA7AAG777C7GGGACC7CAG7777A7CA 
T777CAAAA7GGAGGGAA7AA7ACC7AAGCC77CC7GCCGCAACAG7777 
7TA7GC7AA7CAGGGAGG7CA7T77GG7AAAA7AC77CTTGAAGCCGAGC 
C7CAAGA7GAAGGCAAAGCACGAAATG77A777777AA77A77A777ATA 
7ATGTA777ATAAA7A7ATT7AAGATAA77A7AA7A7ACTA7A77TATGG 
GAACCCC77CA7CC7C7GAG7G7GACCAGGCA7CC7CCACAA7AGCAGAC 
AGTG7777C7GGGA7AAG7AAG77TGAT7TCA77AA7ACAGGGCA777TG 
G7CCAAG77G7GC77A7CCCA7AGCCAGGAAAC7C7GCA77C7AG7ACTT 
GGGAGACC7G7AA7CA7A7AATAAA7G7ACA77AA77ACC77GAGCCAG7 
AA77GG7CCGA7C77TGAC7CT77TGCCA77AAAC77ACC7GGGCA7TC7 
7GT77CA77CAA77CCACC7GCAA7CAAG7CC7ACAAGC7AAAA77AGA7 
GAAC7CAAC777GACAACCA7GAGACCAC7G77A7CAAAAC777C77TTC 
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TGGAATGTAATCAATG1 . rCTTCTAGGTTCTAAAAATTGTGATCAGAGuA 

TAATGTTACATTATTATCAACAATAGTGATTGATAGAGTGTTATCAGTCA 

TAACTAAATAAAGCTTGCAACAAAATTCTCTGACACATAGTTATTCZATTG 

CCTTAATCATTATTTTACTGCATGGTAATTAGGGACAAATGGTAAATGTT 

TACATAAATAATTGTA TTTA GTGTTACTTTATAAAATCAAACCAAGATTT 

TATATTTTTTTCTCCTCTTTGTTAGCTGCCAGTATGCATAAATGGCATTA 

AGAATGATAATATTTCCGGGTTCACTTAAAGCTCACATTACACATACACA 

ASACATGTGTTCCCATCTTTATACAAACTCACACATACAGAGCTACATTA 

AAAACAACTAATAGGCCAGGCACGGTGGCTCAGACCTGTAATCCCAGCAC 

TTTGGGAGGCCAAGGTGGGAAGATCACTTGAGGTCAGGAGTTCAAGACCA 

GCCTAGGCAACATAGTGAGATCTCATCTCTACAAAAAAAAAATGAAAAAT 

TAAAAAATGAGCTGGACATGGTAGTACACACCTGTAGTCCCAGCTACTCG 

GGAGGCTTGAGGTGGGAGGATCACTTGAGCCTGGGAGATGGAGGCTGCAG 

TGAGCCATAATCACACCATTGCACCCCAACCTGGGCAACAGAGTGAGACC 

CAGTCTCAAAAGATAAATTTTTAAAAATGTTAAAAAATATATAAAAGAGA 

ATTTTAAAAGAACAACTAATAGATCAAAGCATGGATGCAAGATATATTTA 

GTTGGAAAATCAAGGTTAAAATCAAGGGATCTTGGAATTAGGTGTGGTAG 

ATTTGGGTAAGGAGTAGTCTAAGATGACCCTGTTTCTTGGTACTGGAGAC 

TGGATGAGTGGCAGCGTCTTAACCATATTTTTGGTAGAAATATGGAGGTC 

TTCTCCATTCCAGGATGAATGATGAGTAAAATTTTAGGCATGTAATTTGA 

GCTACTAGAAGGACACTCAATTGCAGATGTACAATGGGGAGATGATAACC 

TATCTGGAACTCAGAAAAATAACTGTATATAGATATGAAAGACATCAGTA 

GG7ATGTAGTAGATAAAATCCTAAAAGTGATGTCAAAGGGAGAAGAGAAG 

TATATGGTGAACACTGTTGTTTGTCCATGCAATTGCCATCTCTTCTTCTT 

CCTTACTGACAGAACCCTGATTTCACTGAGAAGTCAACATGCCCTTCCCC 

AATTGATGAATCCAATTGGTTGAAGATTATGTTCATTCTATTCTTACATG 

ACTAAGTCACGTTGACTTAATCCTATCAAATGAGATGTCGATCTGGAAAC 

AACTTCTGGAAAAGATTTTCTACCTTGATAAAATAAAGAGCCATATAGAT 

GGTCCTTTATCTTCCTTCTTCCTTGAATGAGATATGTTCTATGAGGAAGT 

GAAGCTTAGAACTGTGGTCAGCAACTTGCAACGACTGGGAAGTCAGAGCC 

ACACAATGAAGAATGCAGAGTGGAAGGAGAAAAAGAGCCAGCATCTCTGA 

CAACATTGTTACACCGAGAACCTACCTCCAGATTTTAAGAAAACAAGAAA 

TGCTACTGTTATTAAGCCATTTCACTGGGTTTGCTATGACTTGCAGTCAA 

ATCTAGCTTAACTGATACAGAGCACCACAGAGAACTGGTCTCTCATTTGT 

CTCATCCTGTTCTTTCTAGCAGCCACGACTTTCCTAGGGTTTCCTTAGCC 

CAAGTCTGGCTAGAGCAAGACTAAGTAAGACTTGATTCCTTAATGTCCTT 

TTGTTTTAAGAAATATTAAAGAATTATTTTTATATTAATATATTTTAAGA 

AATAAGGAAATACAAAACZACTGAGCAAGCAACACAAATTCAAGAAATC 

AAAAAGTATAATAGCTGCTCAGTC7CTGATTAACAGTGAAATATGGAATC 

ATTGTAGAAATGGCCTTOGAGCGTTATTCTCCCAGGCCAGCTATCCTTAT 

GGTCTGCCCCACCTCCCTCATTGCCTAAACAGTAAGAGAGTCCCATGGTG 

AGACTCAACAGTCTTAGCACAGAACTTGTTACAGTCTATTTCTTTTCTTA 

CAGTCCTATATATCAATTCCAAATCAATGAGAGTAAAGCCCAATCCCTGC 

CTTTAAACCCAAAGGACAGAAGCCCAAAGCCCAAAGATATTCCCTAACCT 
TCTCCCCCT 

>Contig28 

CCTGTCGCTCCCTATGTTTAAAGCTGGGGATCTCTTTTTCCTGTGTCTAA 
TTATTTTCCTCATTGGCTTGAAAAATCTGATAAAACATTTTAGGACTGTG 
TATAAAATAGAATTAGCCAAGTGCAATGTCTTTATTCAGAAGAAATTTCA 
TGGACGTTGTGCCTACTCTCTTGGCTTCCTGGCTTCATGGCTTTCCAGAT 
CCCACAGTAAGCTCTGGATAGTAGAAGTTATAGTAAGACTGACTTCTAAA 
TAAATGAAGTGACTTTAACCTTACTGATATGGCTTAAAGAAAAGGAGTGG 
CCTTTAAGATCCATGAACTTCTCAAACAAAAGTGATAACGTTATCTCCAT 
GCATATATAATACTAAATATAATGCAACTGAGAGAAGTAGGCTGTGGTAA 
GAAAGGAGACCCAAGTGCCATCTGAAGGCAGCACTTACCACTCTGCTTCA 
TCCCACCGAGGAAACAAAGCATGAGTATTGCCAGATTTTCTTCTGTTTCA 
AGAAAAGCCAGAAATCCAGGTTTTTGCGTGAAATGTCCTGATTTTAATGT 
TGGGAACTAATTTATATTTTGAAATAACATTGTGTGGGACAAGTGAACTT 
GTATGTGGAACTGCTTTCTCCCAGTGGCGACCAGTTTGGACCGTTGATAC 
TCAGCAAGTTCAGCCAAGTGCGCCTTGTCATTGTCAGTCATCAAGGTGAT 
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37G7GA77GG7CAAACAKT7AG7777GC7CAGCA7C7CG7G7GTTrrCAA' 

AGGACCTGAGGGTTCATTTGCCCATGCAGATCTTGTAGTCCTGTTTATTC 

TATTAATTTATCTTGCAAATCTATAATGTTTTATTTTAAGCAGCGAGAGC 

CGTGGCAGCCTTTGGTCTGGACCCTTTCTAATGATCATTTAGTATCAGGC 

TATGTGGGAGTTGATTGTTTTGCATTGCCTGAAAGCCAACAGTATCACTC 

CTCCTCTAGGTGTGGCAGAGATGTGAGAGAGGGAGACTGACAGTCTGTGG 

Gi^TGTATGCAGTGTTGGGGGAAGCGAGGCACAGGGGACAATACTGTGGT 

3TATAAAACTAGTCTAAGGTAGCATCAGGAAGTTCATGAAGCCAAAATGA 

TTTTCATAACAGCACAAGACATTATTTGTTTTTGCCTCCCTCTCATTTTT 

TTTTTTTTTTGAGACAGAGTCTTGCTCTGTCATCCATGCTCGTGTGCAGT 

GGTGCAATCTCGGCTCACTGCAACCTCCACCTCCAGGGTTCAAGCAATTC 

TCATGCCTCAGCCTCCTGAGTAGCTGATTACAGGTCTGCACCACCCCGCC 

GGCTAGTTTTTGTATTTTTAGTAGAGATGGGGTTTTGTAATGTTGGCCAG 

GCTGCCCTGTCATTTTTTTTTTACTAGTGTCCAGTGGAGTTTTTTAGGGG 

CTACATAACATGATACTGTCATTAATCTAATGGCTAATGAAAGGGATATG 

TATATGTTTTTGTGTTTAAAACAAACTTCTTTGGGGTCCTCAATAATTTT 

TAAGAGTATAAAGGGGTCCTGAGATCAAAGAGTTTGAGTTCTGCTGGACT 

GGGACAGTGGTTGTCAACCCAGATTGTACATTAGGGTCATCTGGGAAGCT 

TTAAAATAGTACTGATGCCCAACCTTACCGCAAACCAATTAAGCCAGAAT 

CTCTGTGGATGAGAAGTCTTCATTGTCATCATCACCATGACCATCATCAT 

TGTCACCGTCACTACACCATTATCATCATCATCATATCATCTTCATTATC 

ATTGTTAGTATCTCCATCACCATCATCAGCATCACCATTATTATCATCAT 

CATCATCCCCACCATCATCCTCATCGGAACTTCACC7GCATGGAGGACAA 

TCCACTATGCATTAGGTGCTATGCTATTTGCTATACTCCTTATTCTCACA 

ACTGCCCAGAGAGGCTGATATTATCTCACTTTATAACAGGAGGAATCTGG 

ATCGGAAAAGTTAAGGTAAGCTAATTCACAGAGCGAGAAGAGATAGAGCC 

AGGATTCGAAACCAGTTCTCTGCTACATCAATGTTCCCAGTCCTTGCACT 

ATTGAGAACCTCTTTAGTTATGCTTTCACCCCTCCAACACCACAGTAAAT 

TTTTTCTTTTTTTAAAAAAATTATACTTTAAGTTATAGGGTATATGTGCA 

TAATGTGCAGGTTTGTTACATATGTATACATGTGCCATGTTGGTGTGCTG 

CACTCATTAACTCGTCATTTACATTAGGTATATCTTCTAATGCTATCCCT 

CCCCGCTCTCCCCACCCCATGACAGGCCCTGGTGTGTGATGTTCCCCACC 

CTGTGTCCAAGTGTTCTCATTGTTCAGTTCCCACCTATGAGTGAGAACAT 

GTGG7GTTTGGTTTTCTGTCCTTGTGATAGTTTGCTCAGAATGATGGTTT 

CCAGCTTCATCCACGTCCCTACAAAGGATATGAACTCATCCTTTTTTATG 

GCTGCATAGTATTCCATGGTGTATGTGTGCCACATTTTCTTAATCCAGTC 

7ATCATTGCTGGACATTTGGGTTGGTTCCAAG7CTTTGC7ATTGTGAATA 

G7GCCACAG7GAACA77CA7G7GCA7G7G7C777A7AGCAGCA7GA777A 

7AA7CC777GGG7A7A7ACCCAG7AA7GGGA7GGC7GGG7CAAA7GG7A7 

77C7AG77C7AGA7CCTTGAGGAA77GCCACAC7G7C7ACCACAA7GG77 

GAA77AG777A7AGCCCCACCAACAG7G7AAAAGCA77CC7A777C7CCA 

CA7CC7C7CCAGCACC7GTTGT7TCG7GAC77777AGTGA77GCCA77C7 

AAC7GGCACCACAG7AAATT7T7A7AGA7777A7AAGCAAA77G7A777A 

C7G7GCAAGAA7TGGT7TATTTT77AAACCA7G7G77GCAAACA7ACAA7 

GG77AA77G7GA7ATT7GCTCAGTACAAGATCA7CAGATCAC7ACACAGA 

C77GAGG7AA77CCACC7AAAAGCAAAGAGAAC7GACCCCACA77AAC7G 

AGAAG7C777AC77AT7TATTCCC7A7AAACGAGCCAA7A7GAAGAGAAG 

GCC77AA7G7GG77AACTATGTAA7777777C7GAC77777GAAA7AC7G 

AGAAGAGC7CA7GACTCTCCCA7C7CC7AA77C7ACC77GG7GGA7T7TA 

GAC7GACCACAAC7CA7GGG7AAATGAGGGAAGACGAA7AAGAAACC77G 

C77777777CC7CC77GTT77TGGC7GGC7GCAG7GGC7CACACC7G7AA 

7C7CA7CAC777GGGAGGCCAAGG7GGGAAGA7CAC77GAGC7CAGGA77 

TCAAAAC7GGCC7GGGCAACA7AG7GAGACCCCA7C7C7AAAAAAAAAAA 

AAAAAAAAAAAAAGGCGACAGGCGG7GCG7GCC7G7AA7CC7ACC7AC7C 

AAGAAGCCGAGG7GGAAAGA7CAC77GAGCA7GGGAGG7CAAAGC7GCAG 

7GAACC77GA77GCACCACT7CA77CCAGCC7GGG7GACAAAGCAGGACG 

C7GCC7CAAGAAAACAAAAACAAAACC77AA777777GGCTA77C7777C 

7GG7AAGAA7GG7A7AGAGA7GGGGA7GAGGA7GGC7A77G7A7GAGAGA 

GCAAACAGGG7CCAAGCAG7GC7C7GGGC7G7C7AAGGACCAG7AG7CAG 

C77AAC77C7CAAA777CCAGGGAAGGAG77CGGAG7GG7AGAA7A7CC7 
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GGGTATGCCCAAAGCATLaCCTTGC x * 

T7CA777G77A7GAC7GGAAAC7GGC777G7G7A7GCCAGAGAA7GGGGG 

CAGGAAAGAGAGATTGGTGTCTTGAGCTCTCTGTGCCTCTGGGGCAGTGA 

TGCTTTTCCTCTCATGTGGAAGGAGAGCATGACTGAAAAGGTGCACAAAT 

AAGGTGTCTGTGAGAGAAATTAACCTTCCAGATACAGAGACACAACCTTC 

CCCAAGAGGTCCTCATTGCTCTGCCTTTTTTCCTTTTTTTTGCTTGTTCT 

ACCATTAATAACAGAAACTGATTATGACCTCAAAAGAGAGGAGAAAGCGA 

CTCTCCCCACCCTAGAGCTAGTTAACCACCATATCTTCCTAGATATCCTT 

GAGAGCAATGTAACCC 

>Concig29 

GTGAACTCGTTTTACCTGTGTAGCAGACCAAGCCGCAGACAAAATCCNTC 
AGACACCAAATTAAAGAAGGAAGGGCTTTATTGGGCCTGGAGCTGCGGCA 
AGACTCACGTCTCCAACAACCGAGCTCCCCGAGTGTGCAATTCCTGTCCC 
TTTTAAGGGCTCACAACTCTAAGGCGGTCCACATGAGAGAGTCGTGATAG 
ATTGAGCAAGCAGGGGGTATGTGACTGGGGGCTGCATGCACCTGTAGTTA 
GAATGGAACAGAACATGACAGGGATCTTCACAGTGCTTTTCTTATGCAAA 
TAACCGATTAGATCAGGGGTGGATCTTTACCAGGCCCAGGGTGTGTCACC 
GGGCTGTCTGCTTGTGGATTTCATTTCTGCCTTTTAGTTATTACTTCTTT 
CTTTGGAGGCAGAAATTGGGCATAAGACAATATGAGGGGTGGTCTCCTCT 
CTTACCTGCGGGGAGTGAGCTCAAACTCCTTAAAGGAGTTACCTGCCTTC 
CATCATCAGGGAAGCAGGAAATCTTGCCTTCCTTGTTGGAAGCAAGTAAA 
ACTCAAAACAAACAAAGAAAAAAACAGGGAGTTGTACAGCAAAATAAACT 
T77GA7T77GACCAAA7777GGGAGA7CAGGAA77C7C7GAAGGAGA7GC 
TTTCAGACCTCAGCAAATTGTCCTGT7GGTTTGAGCCATAAAGTTAGCTC 
ATGCTGGTACCAAACACCAGTAGGAGATTTGTCAAAGGTAAGAGGCATCT 
CCACTCAGAATCCCTTCGTGGTTACCAACATGTGAACCTTGGAAATCTGA 
GACAGG7CTCAGTTAATTTAGAAAGTTTATTTTGCCACGGTTGAGGACAC 
CCACCCATGACAGAGCATCAGGAGGTCCTGACCACATGTGCTCAGGGTGG 
TCTGAGCACAGCTTGGTTTTACACATTTTAGGGAGACATGAGACATCAGT 
GAATATATGTAAGATGTACACTGGTTCCCTCCAGAAAGGCAGAACAACTT 
GAAGCAGGGAGGGAGCTTCCAGGTCACAGGTAGGTGAGAGACAAACAATT 
GCATTCTTCTGAGTGTCTGATTAGCCTTTCCAAAGGAGGCAATCAGATAT 
GCATTTATCACAGTGAGCAGAGGGGTGACTTTGAATAGAATGGGAGGCAG 
GTTTGCCCTAAGCAGTTCCCAGCTTGACTTTTCCCTTTAGCTTAGTGATT 
TGGAGGCCCCAAGATTTATTTTCCTTCTACATCACTGTGGGCAGCTGACT 
AGGAAAGC777G7AGGAC7GG7GGGCAG7G7GAGAGCCCAG7GGGGGG7G 
G7GG7CC7G7GCCAA7GG7AGCAACCACC7G7GAGGC7GAG7AAAC7CA7 
T7CCCAACC7CC7C7AGCAGCCCCAG7GGAGA7ACAGAGGAAGCAGAC7A 
3CGA7ACAACCCAGCC7GAAG7777G7C7GG7GAG7G7AA7GGAA7AAAA 
A7GGGAAGGG7GC7GAAGAGACCAGCAAGAAAA7GG77GAAGAGA7GGGG 
CACAGAAA77AAGC7GGA7CAAAAAGGACGGAAAAGCAGAAAGGGCCGA7 
AGAGAGAGGGGA7A7C7A7GGG77CGCGAT7C7GAAAAGGACAAATCAC7 
GG7GC777GAGAAGAGAGAGGG7GAGAAAGCAGGAAGGC7GGAGGC7G7C 
A7CCAAGAGGCGGACA7C7GTGAACA7GAT7CCAAGAG7CACCAGACCA7 
GGGGG7GGCCAAAGGGAG7GCC7C77C7CACC7CC7ACTC77AA77CC77 
G7AC7CAAGA7AA7AAG77CCCAGAAGAGAAG7ACCCA7A777AA77CA7 
C7G7G7C77CC7AGCAG7AC7AAAAA7A7TA7A7GAAAGG7ATCAAACC7 
77GAGAA7G7G7GC7GC7AAA7TG77AAGGA7GC7GGAAAACTCAAGACG 
7CCC7GA7CC7GAGCC7GAGTATGAGCC7G7GG7GAGCCCAATGCAGG7C 
7CCA77CAGACAAAGGCC7CAGGGAACGGATGAGACC7AGGGACAGAGA7 
GCA7GC7GGAGCAGCA77CCCCA7CCC7AC7GCAGC7CAGGCCAGC7GAC 
TGC777A7GAG7AAACG77ACCAGGGAACAC777GCAG7C77AACACACA 
7GCCCACC7G7GACCAC7GA7CCC7G77GGG7GACCAC7GACA7CAGAGA 
77CGA7GGCAGCAA7GAAGACAAGGC7A7CC7CA77AGGAAGGAAAGGAA 
GGAGGAGGGAGGAGGGCAAACGAA7C777CC7GC77G7CAACCACG7CCA 
7C7C7G77AGG7GA777CCCA7G7G7GAC777G777A7C777A7AA7AAC 
7C7GAGAGG7AGG7C77GA7G7CCACA7777GAACA7GAGGACA7CCAGC 
CAGGAAG77GAG77C7GGGGACA7AGC7GAGAGGGCAAAGC7ACA7A7AA 
ACCCC7C777G777777C7GGC77A7CCAC7GAG7GCCCCC7GCAA7CCA 
CCAGCCCA777G7GAAG7GCA7AC7A7AGG7AAG77GGCACAGGAGGAG7 
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GGA7G7GGGCGA7777G ± CACAGC7C7CCAGGAAC77ACACAC7 GGTGAU J 

GAGGGCCAGG7A7G77CC7GACCAG7CACAA7CAAAGCAACC7CC7AC7A 

ATCAGGGAGGCTTGGTACCTGGGGAATGCTATGTTGAAAGGTTCTTTTCT 

GGGTTTTAAAATGATGGGTCTATTTCCTTATTCTTAAGATTGC T TTTT T T 

CTGGCTAGAACTTAAAAGAAATTTTCAGTAAAATTTCCCTTCCCTGGCAC 

AAAGTGAGCTTGAAATGAATTCCCAGGTGGCCTTGATACTTTAAAATATT 

GCCTCCTATAAAATCAACCTTTAGAAGAAGGAAGTCAAAGAACATGCTAG 

ATTTCACAAAGGTTAATTCCTTGAAATCCAGT7ATCTACAGGACAATGTT 

GTCAAAGAAAAAATTATTTGGCCAGGCACGGCGGCTCATGCCTATAATCC 

CAGCACTTTGGGAGGCTGAGGCAGGTGATCACCTGAGGTCAGGAGTTCGA 

GAC CAG CCTGGC CAACATGGTGAAAC C C CATCT CTACTAAAAATACAAAA 

AAAATTAGCCAGGTGTGGTGGTGGGCACCTGTAATCCCAGCTACACGGGA 

GGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGAGGAAGTTGCAGTGAG 

CCAAGTTCAAGCCACTGCACCCCAGCCTGGGCAACAGAGCAAGACTTTGT 

CTCCAAAAAAAAAAAAAATTCAATGATATTTTTAAATTCATGGTAAGGAA 

GATTTCATTCAGAACCAGCACAGAAGATATAGGAAACACTGCAATGGGAC 

TTTGCGGTGGGGGAGAGAGATTGAACACAACTACATATACAGCACGGGCA 

AGGACATATTCATAGCCAGGAAGCAGAGCAAAGATCAGTGGATGCGAAAT 

TACTAAGAGGAAACATGAAAAATAAGGGAGCTTCTGCCTAAACCCACCTA 

ACCGGATCCTTGCTGAAGACAGGACAGGGTGATTGGACACCACTTTGGGG 

ATGGTGGAGGATGGGGAATCCAGTGAGATTTCAAGGGTGATGCGATATTG 

AACATACAAAGTTCTTGCTAAAAAAGGATTTTACAAGAAAGTGTACAAAT 

G7GCCTGGGACAAGG7GCAGGAGCCCGACGGAGATGTGGTCCAGCAGAGA 

A7A7G7GCCGAGA7GA7AGG7GAG77C7C7GACGAAGGA7A7A7GC7GA7 

CCAGCCAGGG7GAAA7GC7CAGAGAAAGCACGGAGGGGC7A7G7CCG77G 

CCCCAG7C7CCACGCGG7CAAA7C7GA7CCCG77G7GAG7G7GGCCG777 

G7AGAAAGCAA7CAGGGGGGG7CCC7CCCC 

>Contig3 0 

AATA7A7A777777A7ANNA7N7GAGACAGGT7C7CAC7AGG77GCCCAG 
GCTGG7C77GAA77CC7GCCTTCAAG7GAC7C7CCCACC77AGCC7AC7G 
CATAGC7GGGAT7ACAGGCACAAACCAC7GCA7GCAGC7AAC777GC77C 
7CA77CCAGCAC77777A77CCAC7GA77A7A7G7A7A7G7A7A7C7GCA 
7CA7C7C7C7C7CTC7C7C7CTC7CTC7C7CTC7A7A7A7A7A7A7A7AT 
A7GGAAA7A7C7C7C7C7C7C7C7ATA7A7A7A7A7GGAAA7A7A7A7C7 
CAG7C7C7CC7A7CC7CC7TTAA7CAG7777GC7A7CC7G7CAA77CCCC 
CAACGAG7G7GA7G77G7GAAA7A7A7A777G77C77CA7C7CC7G777C 
C7GACA7ACAGC7777AAAAACCC77GGAA7C7C7GGAA7AA7AAGAG7G 
7C777TGCA7GC7AA7AGA7GAC7GC7GGC7GGCAGCCCCAA7GCAG7AG 
CT7CA7GA7GGGG777G7CACAGGAAAGACCAAGGCAGGA77GGAGAC7T 
GAGAC7G77AGCCCCAC7CCCCAACCAC7GGAGGGAG7GGAGGGGC7GAA 
GG77G7G7CAG7CACCAA7GGCCAA7GGT7CGG7CAA7CA7G7G7A7G7A 
A7AAAG C CAC7 C77AAAAAC C CAAAAAGG ACAGGG777GGAAGGGC7 CC C 
AGA7AGC7GGACACA7GAAGG77CCTGGAGGG7GG7GCCCCAGAGGGGCA 
7GGAAGC7CCACACCCC77C7CACA7GC777GC7C7GCGCA7C7C77CA7 
C7GG7G77CA7C7G7A7CC7TTG7AA7A7C7T77AGAA7AAAC7GG7AAA 
C77AAG7G7777CC7GAGT7C7GTGAGC7GCTC7AGCAAA77CACGGAAC 
CCGAGGGAAGCAAACCCAGA777ATAGCCA7CAG7CAGAAGCA7AGG7GA 
CAACC7ACCAC77GTAAC7GGCACCTGAAG7GGGAGGCAG7C77G7GAGA 
C7GAGCCC7CAACC7G7GGGA7C7AACGC7AAC7CCAGG7AGA7AG7G7T 
GGAG7GAA77AGGACACCCAACTGGTG7CGGCTGC7GGAGGAC7AG7GG7 
GGG AG AAA7 C C C CAAG CAT77CGG7GAC 7AG AGG7CACAGAAGAAC7CAG 

7G77GAGG7G77G7GACAG7A7GG7AGGGAAAAC7GCG7C7GG77777TC 
C7777ACAATCAG77AAA7A777AACACAAG7C7AC7G7A7A77AG7AAA 
AGGG77ACA77TT7TAA7G7C77GACAG77GCAC777GACAAC77CCA7A 
7CAA7CAC7777T77CG7G7CCG777GGAACCAAAA7CAC77GGGA7ACC 
A7GAACCAGGC7GCAGCG7A7TCCCCAGGCC77GAAAGC77GGAGGCCA7 
777GCCAGCCN7AA7CCC7G7GAA7ACCAGGC77CG7GGA777AAAAAAT 
AGAC77GAGGCCAGGCC7GG7GGC7CACACC7G7AAGCCCAGCAC777GG 
GAGGCAGAGGCGGA7AGA7CACAAGG77AGGAG77CGAGACCAGCG7GGC 
CAACA7GG7GAAACCCCG7C7C7AC7AAA7A7ACAAAAAAAAA77AGCCG 
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GGCGTGATGTTACACGw JAG7AG7GCCAGA7AC7CAGGAGGC7 GAGGL^G 
Z-AGAAATAC3rrGAACCTGGGAGGCAGAGGTTGAAATGAGTCAAGATCGTG 
CCACTGCACTCCAGCTTGGGCGACAGAGTGAGACTCAGTTTTCAGGGGAG 
TTAAAACAATACAAAAAAAGAAAAAGACTTGAACAATGAGGCTCCACTGG 
ATGGATTTAGGGGAATTACAGGAAGCAGGACCTGACGGTGCAATGCCACA 
CTCCACCTGTCCAGAATTGGACCTCACCAAGGGAGGTCTGTGGGGACAGG 
GAGAGGCCCTCTGCCTCCACCCCCTCCTC7ACTCCCCAAACCCTGAGTCA 
G5CTGAATGTAGTAAACCTGGAACAGAAAAGTTCAGTTTGGCAATAGGTA 
TCTGAAGGACTCCAGGTGCTTCTCCCTTGATTCAAAATTTTACTTATAAA 
AAAAATTATAAGAAAATTCTACTTAAAAGAAATAATCAGGGAGGTACAAC 
AAATTGTACTTTTTTTTTTTTTTTTTTTTTTTTGAAATGGAGTCTCACTG 
T7GCCCATGCTGGAGTACAGTAGTGTGATCTCGGCTCACTGCAACCTCCG 
CCTCCTAGGTTCAAGTGATTTTCCTACTTCAGCCTCCCAAGTAGCTGCGA 
TTACAGGTGTGTGCCACCACACCCGGCTAATTTTTGTATTTTTGGTAGAG 
ACGGGG7TTCACCATGTTAACCAAGATGGTCTCGAACTCCTGACCTCAGG 
7GACCCACCTGCCTCAGACTCCCAAAGTGTTGGGATTACAGGGGTGAGCC 
AC7AAGCCCAGCCA7TG7ACATA7777G7GGG7A777AC7AAAACA77A7 
7CAAAA7AG7AAAAAAAAA77GAAA7AAAC7GGGGAC7GG77AAA7AA77 
77GGG7ACAACCACATGA7GGAA7AC7A7ACAGCCA77AAAAA77ACA77 
GAGGCCAGG7G7GGTGGC7CATGC77G7AA7C77AGCAC777GGGAGGCC 
AAAG7GGGAGGA77GCT7GGACCCAGGAGC7CAAGACCAGC77GGGCAA7 
G7GGCAAAACCC7G7C7C7AAAAAAAAAA7ACAAAAAAAA77AAAAAGC7 
GGG7G7GGAGGCACACACC7C7AG7CCCAGC7AC7CAAAGGGC7AAGG7G 
GGAAGA7CAC77GAACCGGGGAGG7CAAGGC7GCAG7GACCCAAAA7CGG 
G7CA77GCAC7CCAGCC7GGGCAACAAAGCAAGACCC7G7C7C^AAAAAA 
AAAAAAATACA77GAAGAA7A7C77ACGG7A7GGA7AAA7A77CA7777A 
CAG7GA7AGA7GCAAA7AAAAGCAAA77ACAAAA7A7ACAG777AA77CC 
AAC777GA7AC7ACATATG7A7A7A7GAA7ACA7GCA7A7G77A7G7A7G 
7ATATG7AAA7ATAACAATATA7G77C7A7A7A7GGATA7TATA7AT77A 
CACA7ACATACACACATA7ATAATA7C77C7C7AGAGAGCAGAAAGAGAG 
7AGACAGA7AA7GAAGA7AGGATACAAC7CCAGTCCAGC7CAACC7AGGG 
GAC77GT777AAAGCCTCAGGAGAGAGAAG77GGGAC7AGAAAGCAAGGC 
AGC7A777G7AAGCA7C777G7G777CA7GC7A77GGGG7GGGAAACAAC 
AGCACAAC7777GAAAGCCCC777C7AC7CACCCCACAAAC7GCAGAGCA 
GC777AGGACCCTCAGAGTTCAAGAAGACCA777GCAGAG7AGAAGAAG7 
AAAAACA7G7A7GAACT7GACCC7GAGC7CA7GGAC7G7GCCA7GAGGGA 
AA77CC7AAAACAGCAGGAGAGGCCC7GGAGGAAGGCAGAGGCCC7GCA7 
CAGCAAG7CCAGGCAAAAGCC7GCA77CCA7AGA7GCTCATC7C7C7GGC 
7GG7GAGG7C7AAAGACGTT7GG7C7CAA7A77AAG7C7CG7GAGAGAGG 
TCACAAACCCAG7CCCTTGGCCACAAAAGGAAA7AAA77C7GGC77GAGA 
CA77AGGGAGGAACAGGGCAAGGGGAGGT7CAAGAAAG777TAA7GGA7G 
AGA7GA7A777AAGCAAGGCCC7GGAAAA7GAGAA777CAACCAA7AGCC 
A7A7GG7AGG7CAGAAAGCAAAGA7AAGGAGGGGGCAAG7GCAAGGGGCA 
ACA7CAGA7A7GACCAGGG7GTCG7GGGGCA7GGC7GA7GGAGAAGAAGA 
77AGAC7GGAG777GGGAA7GCCACAG7A7CGAGG77GGA777AA7CC7A 
7GGG7AA7AAAGCCAAC7G77CAACCCCCAACCCAC77GCAA7A7GGC7C 
CAAAATAGCAGG7GTTTGATAAAA7GAC7AC77TTACTC7AC7A77CCC7 
CCC7C77AAGAAGAAAAAGAAAG7GGAGGC7CAGAGAAAGGCAG7GGC77 
G7CCCAA7CACAC7A7GA777GGCCACAAAACAAGAACGAAA7G77ACAC 
CCAAAAA7GC7GCCTCCACCTCCC77CCT7GC77TCCTCCCTGCTGGAC7 
ACAGAC7A7C7CAAGAGTGACGTACACCA7CAGGGCT7CAGC7777CCCC 
GAAACAA7GCCAAAATATTAGCCA7ACG7CAC7G7AG7AAGAGCCC7GAA 
77GGGAA7CCCAGCTTTGACGCAGACA7GC7GA77GAC7C7G7GACCAT7 
C7C77CAC7TC7CCACTCTATTCT7CCCCACC7G7AAAG7GAGG7CC777 
CCAG7TA7AAAAACAGA7GATGC7A77G7CC7G777TGTATC7AA7C77G 
C7G7G77A7AAAAAAAAAA7AAGGC7C7G7ACA77CA7C77GGCCAA77C 
CC77CT7A7C7C7ACTTCCCACAGCCCC77777CTACAGAAAACCAGCA7 
7G77C77C7GGA7CCATCTCT7AAGAAAGCGC777GCCTCCCCGG77A77 
7AGG7GA7AAGAAGTGTCCTAGA7GACAGCCC7GGAATGGGC7GGAGGCA 
ACAAAAAAGCAAG7GAAA7AGACAG77ACAGCGACGACAA7AA7AACAAC 
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CAACACCTTCTCACTAAAGAGAAAGAAATAAAAAAGAAAATTAA^TCTGC 

CGCAATGCCCACACAGTCATTGAATAACTGCATGTGTACAGCACTTGGTT 

ACTTTTACATACTTCATATTTTAGCCTTCATAGCAGCTCACAGGGGTGGA 

TTTAATTTTTAGTCCAACTCCTGTCACGGTGCCTGGCACAAGTATAATAA 

ATGTTCTGTGAATAAATGACCCTCTTTTTAGATGAGGAAATCGAGGCTCA 

AGGAGAACAAGCAATGTAATGTCCCCCTCCTGTTCAGCCATCTGCCTTTC 

ACGCCACTGAATGCAGTAGTCCTCAGTGCCCTGAACTTGACCCtCTTCTG 

CTTTTCGGACTGGTCCTTCTAATCCCGTTGTGACTCACTACACCACCTCT 

CCTGCATATGACATCTACATTTTAAAACAAACCGTATGGAAATAACACAT 

TAGTCGGCTTG7TCCCCCACCCCCGCAAAAAAAAAGGCCTCTTTATAACA 

GAAACTTCTCAGGCTGGTAGGGGAATTTTATTCCCCCATTTATGGTAGAA 

AGGCCC7AACCTTGGACCTCACGCCATAGCTATTCACATGGGGGAATGAT 

GAATAACATGGGGAGCAGCATGTAAATATCATTGAGCCGTAGTCCAGACC 
TATAACACATC 

>Concig31 

GGGGGAGCTGCATGTGCCTGTCGAGATCTGGGGGAGGAACASGAAGATCA 
AGAGTTCTGTGTAGGACATGTTAAGTTGAAGGTGCTTACAGGATAGCCAG 
ATGAAGCATCAGGTGTGCAGTCAAAGATATGAGTCTGGAGCAGCACATCC 
TAAGTCACCTCCTGCACCAACACAGAACTTCCAGGCCACTCACTTGAGCT 
CTCCCAAATAGTTTCCAAGTGTCATTATGTTAATAACCTATGAGCTTGAA 
CACCAGATTCAAACCCCACTGCATGGCTTTTAAAGACCATCTCAA'GGGCT 
TGACACTCCAGGGAGCCAACTAAAGATGCCTGGTCCTACCATCAACCTCC 
AC C C C ATTTTTTATAGAAAATGTTTCTAC CTGTCCTAAGGCAGGGTCCTG 

CCCCACTCCCAGGCCCCTTTAGATCCCCAATATTCCTCCTCCCTGAACCA 
AAACCCTCATCATCTTCCAGCATGGGTGGGGCCTCCATTCTTGCTTCTGC 
TCCCCTGAGCAGAAGCAAGTTTCTCCCAACTTGACCTGATTCTCCTCCTA 
AGTACCAGTCACTGCTTTGTTTCTGGAATGAGAGAAAAAGACAGAGTGAG 
AGAGACAATCCAGAACTCTTGCTCACTCACAGCTAGGCTGGGCATCTGGG 
AGGATGGCTGTGTCCATGGGAACCTGGGAAAAGCCACACCCTTGGCACCC 
TGGTCACCCACCTGTCTCCCTGGCAGATTCCGCACTGCTCTCTTGCACCC 
TCTACCAGGGCTAACCGGCCTGCTCACTCTCCCCAGCATGTCTTCCCACG 
CCCACTCTCTAATTATTACATTCCCTTCACATAAACTGCCCTTCTCTCCC 
AATCACCACATGTTCACTTCCCACCCAGCTGTCAAAGTCTGGCTCAACCT 
CATTCTTGAAAAGGAAAAAACAAACAAACAAACAAACAAACAAGCAAAAA 
ACCTATGATGGATTAAGAACACACTTCATTCCAGGAACATGCTTATCTCC 
TCTAACTCTCACAACAACTACAGCAGGTAGGTGTTATCACACCCATCTCT 
CAGGTGAGAAAACAGGCTCAACGAGTGCAGGAGGACACAGCAAGTCAGTG 
ACAAAGCTTAAATTCAAGCCCAAGCCTGTTGGCAACCAACGTCTGTACCC 
TTGATAGCTACCTCATTTACCACCAAATCCAGTGGCCTCAGGCCTGGCTG 
CACACTGGGATCACCTGGTGCCCAGACCACATCTTAGACCAGTCATACAG 
AATCTCTTGGGCTGGGATCCTCCACGGTACATTTTAAGGGTCCCCAGGTG 
AGTTCCACCATGGACCCAGAATTGAGGACCCAATACCGTATACCATCTCC 
TTCTTCATCTCTTCTAAGGCATCTCTTACTCGCTGTGCACTCCCATACCA 
CTTTGTTCAATCATCCAATCATTCATTCATTGAGTCAGTTAGTCAGGAGC 
TACTCACTAGTCCCCTGCCAGGTCCTAGTCATGACATAGGGCTCTGGGGA 
CCAACAAGAAGCAGGACCCATGCCTCCTGCTCTCATGGAGCTTGCTCTGC 
AGCAGAGGAAGCAGTCAGTGAGATGTAGCAAATGTGAAATGTGCACAGAT 
GGGAAAAGCAAAACTTTAAAACTTTTAGGACAAAATACACAAGAAATCTT 
TGCAACTTTGGGACAGGAAGGAACAACATTCCTTACACATGACACCAAAG 
GAATCAACCATAAATAAAAAGGTGATCAATTTGACCTCATTTAAGTGTTA 
AGCTTTTTTCATTGAGAGACACCATTAAAAATTAAAAATACATGCCACAA 
ACTGGGATACAATATTTACAACACTTATGTCTCACAAAGGATTAGTTTTC 
AGAATATATAAAGAACTCCCGGCCGGGTATGGCCGCGCACGCTGGAATCT 
CAGCACTTTGGGAGGCCAGCGGATCACATGAGGTCAGGAGTTCAAGACCA 
GCCTGGCCAACATGGCAAAACTCCGTCTCTACTAAAAATACAAAAATTAG 
CCAGGCATGGTGGCGGGCGCCTGTAATCCCAGCTACTCAGGAAACTGAGG 
CAGGAGAATCACTTGAGCCCAGAAAACAGAAGTTGCAGTGAGCTGAGCTC 
ACATCACTGTAAGCCTCGGTGACAGAGTAAGACTGTCAAAAAAACGAAAA 
CAAAAACAAAAACTCCTACAAATAAATAAGAAAAAAATAGCCCAGCAGGA 
AAAAGTATATACATTTCATAAAAGAATAAATACATTCTGTCAGTTTTCTA 
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ACATATATTTTTTAAGAGTAAATACAAATC 

ATGCCCAACCTCATTAAAAATTATAGAAGTGAAAATTAAGCCACAATAAG 
A7ACGA7777A7ACCAAA7ACAG7G7CAACAC777GCAAG7C7GACC7CA 
CCAAG7G77ACCAGACGTG7GCACTGACG7GGC7GC7GAGATAC7GA7GG 
TGGG7C7G7AAA7C7G7AC7ACAAACAA77GCAA7AAAA7G7AA7AAA7A 
TACAATAGGTGGAGCAGGAAGTGACCTGCAACCATATAGCAGATAGGGCA 
GGAAAAAGCCTATGAAAGCTGACATCAAAGGGATAAGTTCCAGTTACCCA 
GCTGAAGGGAAGGAGGGTGTTTCAGATAGAGGAAGGATAAGCATGACCTA 
TTCAAGGCCAGTGAAAGAAGCGTGCAACGGCCAAGTCAGGAGAACCTGAA 
ATTGTGTCAAAGAGCTTGGATGCAAAGAGCCGTGGGAGACTATTGGGGGT 
TTTAAGCAGGGATATAATATTCATTCAAGCATGCAGTAAAAGG7CACTGG 
CACCTGCCATGGGCCAGGACTCGGGCTCTACATGATTGCGTCTGTTTTGG 
AAATATCACCCTGGCTGTGAGATGAAGAACAGGTAGGAGGGTCACAAAAC 
TTGAAGCAGAGAGACTGTTGAGGAAGTAAGCTGTTTTTGTGTGGACTGTG 
GCAATCACAGAGGCAGAGGATATAAATGCACAGAGACACAAGGCATGTGG 
GAGGCAGAAGGAATCAAATACAATGAGTGATCAGATGTGGGGTTAGAGTG 
GTGAGTGAGAAGACATACTCAAGGTGACACGCCCAGGTATCTGGGTGGAT 
GGTAAGACATTCATGGACTAGGATCGAGGAANGAGGTGGGGAATGGGACC 
ATACCTGCAGTTTATAAGGGGTGGACGAGGGAAGATTATGCGGGAGACTG 
AGAGAGGAATAGACAAAGGAATCCCGGTGCAGTATTACAGAAACTGGGGT 
GGGAGGGGGTTGTANTTCAAAAAGGAAAGAAAATTGTCAAATAGTATGAA 
ATGCTGCAGAGAAACTCACGGATTTTTTTTTTAAGCTTAGAATTATTCAT 
TGACTATGTGAATAAGAATAACTTTTATGAAAGAAGTTTTGCTTAAGTAG 
TAGGAAGAAGCAAAATTGTTGAGGGCTGATGAGTGGGAGGAGAAGTAATT 
GAAGG C ACT CTTTCAAGAGAAACAAAGCAG AAGG7GAGGAGAATACTAAT 
GAAGGAGTTACGGCCTTCACTATTTTGTTTTGCTTTAGATAAGCAAGACT 
TGAGTGGGTCTGGTGAGGAGAAACAAGTAGAGTACAAAGTTAAAGGAGAG 
ACAGACAGAGATAGAGATAGGGACAGAGAGAGAGACAGAGACAGAGCACA 
AAAGAGCAAGGTCCCTGAGAACACGGGCCTTCTGTTTAAACCCCAGCCAG 
ATGTATTGCAATTCAATTCCAGTACTAACCACCCAGAGTTTGTGTAGACT 
CTACAAGTTAAAGAGCATGGTCCCCAACAAGACTGCTTCTACGTCAGATG 
CCAGGCACACTTCAGGGGTCCCCAAGCCACTCATGTTTTTTGAATGACTG 
CCATAAGTTCAAAAATTCCCACAATTCTCTCAGATTCAATAACTGGGTAT 
AACCAC7CATAGAACTCAAGAAAATGCTATCATTATTATTACAATTTTAT 
TATAAAGGATACAAATCAGAAGGACTAGCCAAATGAGGAGACACATAGAG 
AGAGGACTAGTAAAAAACAGAGCTTCTGCGTCCTACCTTCAAGGAATCAG 
GATGCACCACCC7CCCAGCACATCAAGTGCTCATCAACCAGGAAGTTCCT 
CTGAGC7CCAATGTCCAGAGATTTTAGGGAGGATTCATTACATAGGTATC 
A77GA77AAA7CA77GGCCA7G7AC77GAAC7CAA7C7CCAG7G7CCC7C 
77C7CCC7AGAGG7C7GAAGGG7TGGC7AA7A7CA7G7GGC7CAAAGCCC 
CAAC7C7AA77ACC77777GG7C7777CAGGGAC7AGACCCCA7CC7GAA 
GC7A7C7ACAGGCCCTGCCATGAGTTAGCTCATTAACATAACAAAGACAC 
T7A7AT7AC7CAGAAAA77CCAACAG7777AGAAGC7CCA7G7CAGGAAC 
C7GGGACA7AGA7CAAATTCrTTTTT7 7 TrrmTTTT7GGAGACAGGGT 
C7TGC7G7G77GCCCAGGCTAGAGTGCAAGGACAGATCACAGCTCAATGC 
AGC77CAAC77CCCAGGC77AAG7GACC777CCACC77AACC7TCCAAG7 
A7C7GGGACCACAGAAAATGGCTAAT7ATCC7GGC7GATTT7TAAAC777 
77777777G7AGGGATGGGATCGCCC7G7G77GCCAAGG77GG7C7CAAA 
C7CC7GGG77CAAGCAATCATTC7GCCCTGGCCTCTGTGATGGTTAATAC 
7GAG7G7CAAC77GATTGGATTGAAGGATACAAAGTATTAT77TTGGG7G 
7G7C7G7GAGGG7G77GCCAAAGGAGA77ACA777GAG7CAG7GGAC7GG 
GAAAG7CCACCCT7TCCCAG7GGAC7GGGAGACCCACCC7CAA7CCAGG7 
AAACACAA7C7AA7CAGCTGCCAG7G7GGTCAGAATAAAAGGAGGCAGAA 
GAACAGGGAAACACTAGACTGGCTTAG7CTTCCAGCCTACATC7TTC7C7 
CA7GC7GAA7GC77CC7ACCCTCGAACA7CAGCC7CCAAGT7CT7CAG77 
777GGAC7C77GGACC77CAACCAGAGA77GAAGACTGCAG7G77GGC77 
CCC7G77777GAGGT77TGGGAC7CAGACTGGC7TCCT7GC7CCTCAGC7 
7GCAGA7GGCCAA77G7GGGACT77AAC77G7GA7CA7G7GAG7CAA7A7 
7CC77AA7AAAC7CAGA7ATA7A7ATA7GTA7CAGACATA7A7ATA7A7C 
C7A77G7A7A77A7A7ACAGATA7A7AA7A7CC7A77A7A7ACAGA7A7A 
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taatatcctattatath^ggtatatatatatatatgtatcXtatataxX^ 

TATCCTATTGGTTCTATCCCTCTTGAGAATCCTGACTAATACAGCCTCCC 

AAAA7GCTGAGATTACAGGAGTGAGCCACAGCCACCATGCCCAGCCCCAA 

A77C77AA7TA7ACAACAA7GGG7CCAGAGA7CAGGGCC7GGG7AGGA7G 

CAGCAATAAGAAAACAGATGGTGGATGGGGACACATGTTGGAAGTGTGGC 

AGGACATGGCTGAGGGAACTCATAGGATGGTGTCTATTTTCATGGCTGAG 

TGTGAGGAACAGC^TAAGGTO^AAATTTCAGGTCAATGGTGAGTTTTTTA 

AATTGTTGCTGTGAACCCCAAAAATCTGACCCAGGTCTCAGTTAATTTAG 

AAAG7CTATTTTTCCAAGGTTGAGAACACCCACCCACTCACGACAAGAGC 

ATCAGGAGGTCCTGACCACATGTGCCCAAGGTGGTAAGAGCACAGCTTGG 

TTTTATATATTTTAGGGAGACGTAAGTCATCAATCAATATATGTAAGATG 

TACACTGGTTCTGCCTAGAAAGGCAGGACAACTTGAAGCAGGGAGGGGGC 

TTCCATG7CACAGGTAGGTGAGAGACAAACAGTTGCATTCTTTGAGTTTC 

TGATTATCCTTTCCAAAGGAGGCAATCAGATG7GCAATTATCTCAGTGAG 

CAGAGGGATGACTTTGAATAGAAAGACAGGCAGGTTTGCCCTAAGAAGTT 

CCCAGCTTGACTTTTTCCTTTAGCTTTGTGATTTGGAGGCGCCAAGATTT 

ATTTTCCTTTCACATTTCCCCCCCTTTCTTTTTAAGAATCTTTTAAAGAA 

AGCTTTTAAAAAGAAAATGAGTCTCTGGTCCCAGGTTTCATCTGAATTCT 

CGAGGGGAGGATGG7TTATCCTAAACGGG7GG7TCTGAATTTTGAGAAAG 
7GCA77G7AC 

>Concig32 

AAAAGCCA7ACGAA7GAGGAAGAA77AAGGGCCAGAACAAAACAAGAAGA 
7GAGGGAAAG777GGAAC77C77AGAGAC7GGC7AAA7GG77G7GACCAA 
AA7GC7GA7AG7GA7ACGGACAA7GAAG7CCAGGG7GACAAAG7C7CAGA 
7GGAAA7GGGGAA777G77GGGAAC7GGGCAAAGG7CACCC77GC7A7GA 
C7CAGCAAAGAAA77GGG7GCA77G7G77CA7G7CC7GGGGA7C7G7GGA 
AGT77GAA7G7AAGAG7GA7GAC77ACGG7AGGG7A7C7AG7GGAAGAAA 
CCTC7AAGCAACAAAG7G7G77GC77AGAAA777C777C777C7777777 
7TT777777GAGC7GGAG77T7GC7G7G7CGCCCAGGC7GGAGCGCAG7G 
GCGCAA7C77GGC7CAC77CAAGC7C7G7C7CC7GGG77CA7GCCA77C7 
CC7GCC7CAGCC7CCCAAG7AGC7GGGAC7ACAGGCGCC7GCCACCA7AC 
C7GGC7AATT7777AG7A7777AG7AGAGACGAGG777CACCA7G77AGC 
CAAGA7GG7C7CAA7C77C7GACC7CGTGA7CCACCCGCC77GGCC7CCC 
AAAA7GC7GGGG77ACAAGCA7GAGCCACCCCGCC7GGCC7GC77AGAAA 
777C7AAGCCAGGA7A7GGCC7G7C7GC77C7AACAGCC7G7GC7CAGGG 
G7AAAGAAA7GAC77AAAG77GGAACC7A7G777AAAA7GGAAG7AGAG7 
C7AAAAA777GGAAAA777GCAGCC7GGCC77G7GGCAGAGAAAGAA7CC 
AAG7AGGC7GCAGAGCAA7CA77GC7AGAGAGA77AGCA7GAC7AAAAGG 
GAGCCAAG7GC7AA7A77CAAGACAA7G77AAAAAGGCC77GAGGGCA77 
7CAGAGA7C7A7GAAGCAGCCCC7CCCA7CACAGG7GCAGAGG777GG7G 
CACTAGGCCCAGAGGTTTTATGGGCCSNNGCCAGGGCCACACTGCTATGC 
ACAGC777GGGACAC7GC7GCCCGCA7CCAGGCCAC7C7GC7C7GGC7CC 
ACCC77GGC7CAAACGGGCCAAGATAGAGC77GGACCAC7GC7CCCGAGG 
GCACAAGCCA7AAGCC77GGTGG777CCA7G7GG7G77AAGCC7GCAGG7 
GCCCAGAA7GCAAGAT7GAGGGAGC77GGGCACT7CCACC7AAA777CAG 
AGGA7G7G7CAGAAACCC7AGG77CCCAGGCAGAAGCA7GA7ACAGGGGC 
AGAG C C C77GCAG AGAAC C7C7AC7AGGG CAA7G C CAAAGG AAAA7G7GG 

GG77GGAGTCC7CACACA7GG7CCCCAC7GGGGCAC7ACC7GG7GA7AC7 
G7GGGAA7GGGGC7GC7GCCCTCCAGACCCCAGAA7GG7AGA7GCAC7GG 
CAGC7GGCACCC7GAGCC7GGAAAAGC7GCAGGCAC7CAAC7CCAACCCA 
7GAGA7CAGCCACA7GGGC7AC7CCCAGGGAAGCCCACAGAGGCAGGGC7 
G7CTAAGGCC77GGGAGCC7ACCCC77GAACCAGC77GCAGGACA7GGAA 
TCAAAGA77A7G77GCAGC777AAGGC77AA7G7777CCC7G7CAA777C 
AGGC77G7G7GGGACC7G77GC77777Tl T r iT T Tr i T 7 77777 7 7 7GG7 
CACAGG7G777GAAC CAGAACAA77C CA7C77GAA7AGGGGC7GGG7AAA 
A7AAGGC7GAGACC7AC7GAGC7GCA77CC7AGGAGG77AGGAA77C7AA 
G7CACAGGAGGAGA7AGGAGG7CGGCACAAGA7ACAGG7AGCGAAGACC7 
CGC7GA7AAAA7AAG77GCAG7AAAGAAGCCAGCCAAAAC7CACAAAGCC 
AAAA7GG7GA7A7GG777GGC7C7A7G7CCCCACCCAAA7C7CA7C7CAA 
A77A7AA77CCCA7AA7CCCCACA7G77GAGGGGAGGACC7GG77GGAGG 
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TGATTGGATTATGGAGGCAATTTCCCCCATGCTGTTCTGGTGATACTbAG 

7GAG77C7CA7AAGA7C7AA7GG7777A7AAG7G777GGAAG77CC7CC7 

ACACACA7GC7CACAC7C7CTCC7GCAGC777A7GAAGAAGG7AC77GC7 

77CC777C7GCCA7GA77G7AAG777CC7GAGGC77CCCAGC7A7GCAGA 

ACTGTGAGTCAATTAAACCCGTTTTCTTTATACATTACCAGTCTTGGGCA 

GTTCTTTACAGCAGTGTGAGAACTGCTGGCGATGAGAGTGACCTCTGGTT 

GXCCTCACTGCTCATTATATGCTAATTATAATGTATTAGCATGCCAAAAG 

ACACTCCCACCATGACCCCAACAGTCATGCCTGTGCCGGTCTCAGCACCA 

TGACAGTTTACAGATGGCATAGCAACGTCTAAAAGGTACCCCATATGGAC 

TAACAAGGGGAGGAACCCTCAGCTCTGGGAAGTGCCTACCTCGTTCCCAG 

AAAGCTTGTGAATAATCCACTGCTTGTTTAACATATAATTAAGAAATAAC 

TATTAAGCATCCTTAGTTCAGCAGCCCAAGCTGCTGTTCTGCCTATGGAG 

TAGCCATTCTTTATTCCGTTACTTTCTTAATAAAATTGCTTTTACTTTAC 

TGTATGTACTCGCCTGGAATTCTTTCTTGTACGAGGTCCAGAGCCCTCTC 

TTGGGTCTGGATCGGGACCCCTTTCTGGTAACATTTTGACCAATTTCTCC 

CTTCTGGAATGGGAATGTTTACACAATGACTGTATCACTTTTGAATCTTG 

GAAGTAAATAATTTGTTTTTGACTTTACAGCCTCATAGGTGGAAGGAACT 

TGACTTGAATTTCAGATGAGACTTTGGACTTTGGGACTTTTGGGTTGGGG 

CTGGAATGAGTTAAAAGTTGGGGGGATTATTGGGAAGGCACGATTTTATT 

TTGCAATATGAGAAGCACATGAGATTTGGGGGACCAAGGGTGGAATAATA 

7GG777GGA7G77TGCCCCCTCCAAA7C7CACA77GAAATG7AA7CCCCA 

GTGTTGAAGTGAGGCCTGCTGGAAAATG7TTGGATTACAAGGCTGTCGAG 

CACATTGGATAAGACG7GTAGGNCCC 

>Concig33 

CGCAGCTCGCTGGTTAATTCTGTGGCTCCTGTGACCACTATTATAGCACC 
AGG7C7A7GACCAGGAGAA7TAGAC7GGCA77AAA7CAGAATAAGAGA77 
TTGCACCTGCAATAGACCTTATGACACCTAACCAACCCCATTATTTACAA 
TTAAACAGGAACAGAGGGAATACTTTATCCAACTCACACAAGCTGCTTTC 
CTCCCAGATCCATGCTTTTTTGCGTTTATTATTTTTTAGAGATGGGGGCT 
TCACTATGTTGCCCACACTGGACTAAAACTCTGGGCCTCAAGTGATTGTC 
CTGCCTCAGCCTCCTGAATAGCTGGGACTACAGGGGCATGCCATCACACC 
TAGTTCATTTCCTCTATTTAAAATATACATGGCTTAAACTCCAACTGGGA 
ACCCAAAACATTCATTTGCTAAGAGTCTGGTGTTCTACCACCTGAACTAG 
GCTGGCCACAGGAATTATAAAAGCTGAGAAATTCTTTAATAATAGTAACC 
AGGCAACACCATTGAAGGCTCATATGTAAAAATCCATGCCTTCCTTTCTC 
CCAATCTCCATTCCCAAACTTAGCCACTGGCTTCTGGCTGAGGCCTTACG 
CATACC7CCCGGGGCTTGCACACACCTTCTTCTACAGAAGACACACCTTG 
GGCA7A7CC7ACAGAAGACCAGGC77C7C7C7GG7CC77GG7AGAGGGC7 
AC777AC7G7AACAGGGCCAGGGTGGAGAA77C7C7CC7GAAGC7CCA7C 
CCC7C7A7AGGAAA7G7G77GACAA7A77CAGAAGAG7AGGAGGA7CAAG 
AC77C777G7GCTCAAATACCAC7G7TCTCTTCTC7ACCCTGCCCTAACC 
AGGAGC77G7CACCCCAAACTCTGAGGTGATT7ATGCCTTAATCAAGCAA 
AC77CCC7C77CAGAAAAGA7GGC7CA7777CCC7CAAAAG77GCCAGGA 
GC7GCCAAG7AT7CTGCCAATTCACCCTGGAGCACAATCAACAAATTCAG 
CCAGAACACAACTACAGCTACTATTAGAACTA7TATTATTAATAAATTCC 
7CTCCAAA7C7AGCCCCTTGACTTCGGATTTCACGATTTCTCCCTTCC7C 
C7AGAAAC77GA7AAGT7TCCCGCGC77CCC777TTC7AAGACTACATG7 
T7G7CA7C77A7AAAGCAAAGGGG7GAA7AAA7GAACCAAA7CAATAAC7 
7CTGGAA7A7C7GCAAACAACAATAATATCAGCTA7GCCATCTTTCACTA 
77T7AGCCAG7A7CGAG7TGAATGAACATAGAAAAA7ACAAAACTGAA77 
C77CCC7G7AAA77CCCCGTTTTGACGACGCACTTG7AGCCACG7AGCCA 
CGCC7AC77AAGACAA77ACAAAAGGCGAAGAAGAC7GAC7CAGGC77AA 
GCTGCCAGCCAGAGAGGGAGTCATTTCAT7GGCGTTTGAGTCAGCAAAGG 
7ATTG7CC7CACATC7CTGGCTA77AAAG7A7777C7G7TGT7GT77TTC 
7CT77GGC7G7T77CTCTCACAT7GCC77CTC7AAAGC7ACAGCC7CTCC 
777C7777C77G7CCC7CCC7GGT77GG7A7G7GACC7AGAA77ACAG7C 
AGA777CAGAAAATGA7TCTCTCA777TGCTGA7AAGGAC7GA77CGT77 
7AC7GAGGGACGGCAGAACTAGT77CC7A7GAGGGCA7GGGTGAA7ACAA 
C7GAGGC77C7GA7GGGAGGGAATC7C7AC7A7CCAAAATTA77AGGAGA 
AAA77GAAAA777CCAAC7C7G7C7C7C7C77ACC7C7G7G7AAGGCAAA 
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TACCTTATTCTTGTGGTGTTTTTGTAACCTCTTCAAACTTTCATTGATTG 

AATGCCTGTTC7GGCAATACATTAGGTTGGGCACATAAGGAATACCAACA 

TAAATAAAACATTCTAAAAGAAGTTTACGATCTAATAAAGGAGACAGGTA 

CATAGCAAACTAATTCAAAGGAGCTAGAAGATGGAGAAAATGCTGAATGT 

GGACTAAGTCATTCAACAAAGTTTTCAGGAAGCACAAAGAGGAGGGGCTC 

CCC7CACAGA7A7C7GGA77AGAGGC7GGC7GAGC7GA7GG7GGC7GG7G 

77C7C7G77GCAAAAG7CAAGA7GGCCAAAG77CCAGACA7G777GAAGA 

CC7GAAGAAC7G77ACAGG7AAGGAA7AAGA777A7C7C77G7GA777AA 

7GAGGG777CAAGGC7CACCAAAA7CCAGC7AGGCA7AACAG7GGCCAGC 

A7GGGGGCAGGCCGGCAGAGG77G7AAAGA7G7G7AC7AG7CC7GAAG7C 

AGAGCAGG77CAGAGAAGACCCAGAAAAAC7AAGCA77CAGCA7G77AAA 

C7GAGA77ACA77GGCAGGGAGACCGCCA7777AGAAAAA77A77777GA 

GG7C7GC7GAGCCC7ACA7GAA7A7CAGCA7CAAC77AGACACAGCC7C7 

G77GAGA7CACA7GCCC7GA7A7AAGAA7GGG7777AC7GG7CCA77C7C 

AGGAAAAC77GA7C7CA77CAGGAACAGGAAA7GGC7CCACAGCAAGC7G 

GGCA7G7GAAC7CACA7A7GCAGGCAAA7C7CAC7CAGA7GTAGAAGAAA 

GG7AAA7GAACACAAAGA7AAAA77ACGGAACA7A77AAAC7AACA7GA7 

G777CCA77A7C7G7AG7AAA7AC7AACACAAAC7AGGC7G7CAAAA777 

7GCC7GGA7A7777AC7AAG7A7AAA77A7GAAA7C7G7777AG7GAA7A 

CA7GAAAG7AA7G7G7AACA7A7AA7C7A777GG77AAAA7AAAAAGGAA 

G7GC77CAAAACC777C77T7C7C7AAAGGAGC77AACA77C77CCC7GA 

AC77CAA77AAAGC7C77CAA777G77AGCCAAG7CCAA77777ACAGA7 

AAAGCACAGG7AAAGC7CAAAGCC7G7C77GA7GAC7AC7AA77CCAGA7 

7AG7AAGA7A7GAA77AC7C7ACC7A7G7G7A7G7G7AGAAG7CC77AAA 

777CAAAGA7GACAG7AA7GGCCA7G7G7A7G7G7G7GACCCACAAC7A7 

CA7GG7CA77AAAG7ACA77GGCCAGAGACCACAC7GAAA7AACAACAA7 

7ACA77C7CA7CA7C77A7777GACAG7GAAAA7GAAGAAGACAG77CC7 

CCA77GA7CA7C7G7C7C7GAA7CAGG7AAGCAAA7GAC7G7AA77C7CA 

7GGGAC7GC7A77C7TACACAG7GG777C77CA7CCAAAGAGAACAGCAA 

7GAC77GAA7C77AAA7AC7777G7777ACCC7CAC7AGAGG7CCAGAGA 

CC7G7C777CA77A7AAG7GAGACCAGC7GCC7C7C7AAAC7AA7AG77G 

A7G7GCA77GGC77C7CCCAGAACAGAGCAGAAC7A7CCCAAA7CCC7GA 

GAAC7GGAG7C7CC7GGGGCAGGC77CA7CAGGA7G77AG77A7GCCA7C 

C7GAGAAAGGCCCCGCAGGCCGC77CACCAGG7G7C7G7C7CC7AA7G7G 

A7G7G77G7GG77G7C77C7C7GACACCAGCA7CAGAGG77AGAGAAAG7 

C7CCAAACA7GAAGC7GAGAGAGAGGAAGCAAGCCAG77GAAAG7GAGAA 

G7C7ACAGCCAC7CA7CAA7C7G7G77A77G7G777GGAGACCACAAA7A 

3ACAC7A7AAG7AC7GCC7AG7A7G7C77CAG7AC7GGC777AAAAGC7G 

TCCC Z AAAGGAG7A777C7AAAA7A7777GAGCA77G77AAGCAGA7777 

7AACC7CC7GAGAGGGAAC7AA77GGAAAGC7ACCAC7CAC7ACAA7CA7 

TG77AACC7A777AG77ACAACA7C7CA77777GAGCA7GCAAA7AAA7G 

AAAAA7C77CC7AAAAAAATCA7C77777A7CC7GGAAGGAGGAAGGAAG 

G7GAGACAAAAGGGAGAGAGGGAGGGAAGCC7AA7GAAACACCAG7TACC 

7AAGACCAGAA7GGAGA7CT7CC7CAC7ACC7C7G77GAA7ACAGCACC7 

AC7GAAAGAAC777CA77CCC7GACCA7GAACAGCC7C7CAGC77C7G77 

77CC77CC7CACAGAAA7CC77CTA7CA7G7AAGN7A7GGCCCAC7CCA7 

GAAGGC7GCA7GGA7CAA7C7G7G7C7C7GAG7A7C7C7GAAACC7C7AA 

AACA7CCAAGC77ACC77CAAGGAGAGCA7GG7GG7AG7AGCAACCAACG 

GGAAGG77C7GAAGAAGAGACGG77GAG777AAGCCAA7CCA7CAC7GA7 

GA7GACC7GGAGGCCATCGCCAA7GAC7CAGAGGAAGG7AAGGGG7CAAG 

CACAA7AA7A7C777C7777ACAG7777AAGCAAG7AGGGACAG7AGAA7 

77AGGGGAAAA77AAACG7GGAG7CAGAA7AACAAGAAGACAACCAAGCA 

77AG7C7GG7AAC7A7ACAGAGGAAAA77AA77777A7CC77C7CCAGGA 

GGGAGAAA7GAGCAG7GGCCTGAA7CGAGAA7AC77GC7CACAGCCA77A 

T77C77AGCCA7A77G7AAAGG7CG7G7GAC7777AGCC777CAGGAGAA 

AGCAG7AA7AAGACCAC77ACGAGC7A7G77CC7C7CA7AC7AAC7A7GC 

C7CC77GG7CA7G77ACA7AA7C7777CG7GA77CAG777CC7C7AC7G7 

AAAA7GGAGA7AA7CAGAA7CCCCCAC7CA77GGA77G77G7AAAGA77A 

AGAG7C7CAGGC777ACAGAC7GAGC7AGC7GGGCCC7CC7GAC7G77A7 

AAAGA77AAA7GAG7CAACA7CCCC7AAC77C7GGAC7AGAA7AA7G7C7 
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GGTACAAAGTAAGCACC^TAAATGTTAGCTATTACTATCATTATOAk 2 

ATTATTTTATTTTTTTTTTTTGAGATGGAGTCTCACTCTGTTGCCCAGGC 

7GGAG7GCAG7GGCGCAA7C77GGC7CAC7GCAAGC7C7GCC7CC7GGG7 

TCACGCCATTTTCCTGCCTCAGCCTCCCGAGTAGCTGGGACAACAGGCAT 

GTGCCACCATGCCCAGCTAATTTTTTTGTATTTTTAGTAGAGATGGGG7T 

TCACTGTGTTAGCCAGGATGGTCTCTATTTCCTGATCTCATGATCCGCCT 

GCC77GGCC7CCCAAAG7GC7GGGA77ACAGGCG7GAGCCACCGCGCCCG 

GCCTATTATTATTATTATTACTACTACTACTACCTATATGAATACTACCA 

GCAATACTAATTTATTAATGACTGGATTATGTCTAAACCTCACAAGAATC 

CTACCTTGTCATTTTACATAAAAGGAAACTAAGCTCATTGAGATAGGTAA 

ACTGCCCAATGGCATACATCTGTAAGTGGGAGAGCCTCAAATCTAATTCA 

GTTCTACCTGAGTAAAAAAATCATGGTTTCTCCTCCATCCCTTTACTGTA 

CAAGCCTCCACATGAACTATAAACCCAATATTCCTGTTTTTAAGATAATA 

CCTAAGCAATAACGCATGTTCACCTAGAAGGTTTTAAAATGTAACACAAT 

ATAAGAAAATAAAAATCACTCATATCGTCAGTGAGAGTTTACTACTGCCA 

GCACTATGGTATGTTTCCTTAAAATCTTTGCTATACACATACC7ACATG7 

GAACAAATATGTCTAACATCAAGACCACACTATTTACAACTTTATATCCA 

GCTTTTCTGACTTAGCAATGTATTGATGACATTATGCATGCT7AGACCTC 
C 

>Concig34 

G7A77C7A77C7CGG77A7AACACAA7CACAG7GA777G7CA7ATC777C 
CAGGA777G77AA777CAC77C77CAGC7G777CCCCC77G77GGC7GGA 
AC7GATT77CTA7C77C7GGGAGAA7C77CAGCAAGCCAAC7CAGGA777 
G77GGG7GCA7777G7CAAG7C7AGGACCCAGGC7C7GGG7GAC7GA777 
CC7C7AA7TACCGAGCAA7G7AAAA7GAGGAAG7C7GA77G7G7AAAGG7 
G77AAAC7T77G7G7GACGGCAAAAC777AA7ACCA7GAA7AGAGA77CC 
AGAA7T77CCAAC77C7AACGGGA77CC777CAC7CCC7GACA77AGAA7 
G77AGAAAA7C7ACCACAAAACA7C7G7GAGGC7A7CC7ACAAGGCCCG7 
7777CAAAA7AGG77777ACAAGGA77GC7A777GGGA7GA7AG777CAG 
AAAGGCGC7A7CAAAG77AA77GA7GA7G7G7GCAAGC7GAAAG77A7A7 
G77AGAAC7AGCAG7GATT7CAAAAA7A7CCC7777AGGC77777GC7AA 
7A7A7C7GC7CA7777CAAAG77CCCAA7A77A7AAAAC77777AAAGCA 
GAAAGAAGAACCC7CCA777C7GC7GGCCCC77CCC7G77CAAC7AAAAA 
G7A7777CCCAGGCAA7GCTA7CCCAGGAC7CACAC7CCA7CCA7CCA7C 
ACC7ACCA7AAG77C777GAAGGGC7CA77C7GAGCGC77CC7GAG7GCC 
TGGGA7C7G77A777C7C7CCA777C7GC7GC7GCA7GG7AG7CCAAG7C 
C7CC7CCC7777CCCC7AGGCCA777GAA7CA7C7GC7AA77GG7777CC 
TGA7TGCCACGGAAAC77CC7CCA7CCC77CC7CACA7A7CAGCCACAGA 
AG7A7C7CCAAAAAGCAAA7C7GG7GACA7GAAGCCC77GCACAAAACCC 
A77CA7TAC7GG77CCACACC7CC777G7GGA7AAG77CAAGC7CC7GAG 
7G7GGCAAGCAGGGCCCACC7GGAA7CCCC7GCCC7CC7C7CC7A7CCCA 
CGCA7CAA7C777CC7G7C7A777GCAG77CC77GAA7G7GA7A77C777 
C7AG7C7C7G7GC7777GCA7AACC7G77C77CC7GAC7GGAAAC7CC77 
C7CC7CC77G7AG777GGCTAA777C7AG7C777CAAGAC7CAGC7CA7G 
C77CACCCCC7C7A7AACAAG7CC777CCCAAGC7GGG7GG7GGATGC7C 
C7C7G7GC7G7G7GAG7C77GAACA7CC7CAGCAAACC7CAGC777G777 
GC77G7C7CCC77GC7G7CAA7GCACC7GA77CAGGGC7GGCA7A7AC7G 
77CACC7CCA7GAC7GGC7CA7GG7GG7GC7CCG7GAA7A7CA7CCACCC 
AAACGGA7GAGAGC7ACCA7GCCA7CAC77G7GAC77CCA7C7GGAGC7A 
ACC7CCCCCGACAGGAAAGCG777CC77AGGAAAGAA7A7C777GGG77A 
AA7AGAAG7AGAGAC7CACCAGAAGCAC7A7G7CCAGC7CAGAA7GAAC7 
GC7CAG7AAGCAGCC77G7CAA7GAGGAGGCAGCAGGCCAGCCCCAGAGG 
CC7CAAAG7GGGAGAG7AGAGAAGCGCAG77CC7GCCACAAAGGCACAG7 
GGACACC77GC7CCCC7GGC7GGC7GGAAGCAGA7GG7G7CCACC7GC77 
CCA7GGGAA77C7GCACC777AA7AAAG7777A7GGGACAGGAAGG7GAC 
7GGCA77GACA77G7AACGAGGAA7GGG7GG7GCCACC777GC7G7G7C7 
7ACCAGAAA7ACC7G7GGCAGG7AAA777C7AGAGAGACCC7CCCA777C 
7CCCA7A7AGCAA7777GAAA7G777CC7GAGGGC777CCAAA77CA7C7 
GGGAACA7AGGAG77CCAGAAAGA7GAAA7CAAAGG7GA7GG7A7GCCAA 
AGAAAG7AGC7T77AGAA7GAC7TACA77AGCCA77CA7CCA77CAGCAC 
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ACCAGGCATTCAGTTTGAGGGGTGTGTGTGTGTGTGTGCGCGCGCGCGxG' 

CG7GCA7GAG7GCA7GCGCGCGCG7G7ACA7AGGGGAAGGGAAACAAAAC 

AAAAGTACACAAGACATGATAGTTGTCCTCAAGGAGTTTTTGCAAATG7T 

CACAATTTAAGAGAATATGCTG7GCTGTGGCTGGTGTATAAACCAACTGC 

7AGGGAGAGGCC7TCCACACACAC77GGGGCAAA7GCGACC7C7AGGAC7 

GCCAGTGGAATCTGGGCATGCTGTTTGTGGTCGATAAACCCTGGTCCCTT 

GATCAGGGACCTATGTTTACTTTTCCTCTCCCTGGAAGTCTTCATTAGTG 

GGCATCCAGAAGGTCTTGCACAGGGCAGAGGGAGGCACAAAGACAAGAGT 

TTGAAACCAGCCTGGACAACAAAATGAGTTTCTATCTTTACAAAAAAAAT 

TTTTAAAAAATTAGCCAGGTAGGATTGCATGTGCCTGTAGTCCCAGCTAT 

TCAGGAAGCTGAGGCAGGAGGATTCCCTGAGACCAGGAATTTTGAGGC7G 

CAGTGAGCTATTAAGTTGGCGCAAAAGTAATCGTGGTTTTTATCATTAAA 

AGTAATGGCAAAACTTTTAATGACAAAAACCGTGATTACTTTTGCACCAA 

TTTAATATGATTGCACGACTGCACTGTGCTCCAGCCTGGGCAACAGAGTG 

GGACCCTGTCACAAAATAATAAATAAATAAAATGTAAACATGTAAAAAAA 

ACC CCAAAAACAAAAAAAATGGGTGTTGAGACCCCTGAATT5AGGAATAA 

TAGGAAGGAGTGTGATTCTGTGTGTGCATGCATGGGTGTGCACCCTCAGT 

GCCTGGGTGGCTTACCCTGGGCTAGTTCAGGTGGCAAATGGTTTTCCTCC 

AGCTGGGC7ACCACCATCTTCCCCCAGGGCCTGTCCATGTATTTGGTGGC 

AAGATACCTATGGACTAGAGTCCCTCCTCAGAGGAAAGGCTCC7CCCATT 

TCTCTGGCTTTCAGGTAGTAGTCCATGACTTCAACAGGTCCCCAGTGCAA 

TGTTATGGG7TAGTTTAGGTGGGGTCTCCTCTGAGAGCCTCCCATAGCCC 

AAAAGGCCC7G7CC7AGC7GGCAC7GCATC7CCC7C77CCCAGC7C7CAG 

CCT77C7C7TTGC7CA7CCCAG7CCGCACAGGC777C7GCC7GA7CC77G 

GA7G7G7CAA7CC7GCCCC7AAGGGA7GCAAGGCAA777G7CC7T77A77 

A77AAGA7C7C7CC7GAGGCCACG7G7GG7GGC7CACACC7G7AG7CC7A 

GAAC777GG7AGGCCAAGG7AGGAGAA77GC77GAGC7CAGGAG77CCAG 

GCTG7AG7GAACCA7GAT7GCACCA77GCA77CCAGCC7G7G7GACACAG 

CGAGACCC7G7C777777C7T7777777T7GAGACAGGG7C7CGC7C7G7 

CATCCAGGC7AGAG7GCAGCGG7G77777C7GCTCAC7GCAGCC7CAACC 

7GCACA77T7T7G7AGAGACGG7G7C77GC7A7G77GCCCAGAG7GGCC7 

CAAAC7CC7GGGC7CAAGAGA7C777CCACC7CAGCC77CCAAAG7GC7G 

GGAC7ACAGGCG7GAGC7ACCGCGCCCAACAAAGACCC7G7C77AAAAAG 

AAAACAAAAA7AAACAAC7CCC7CAAG7C777777777777TTGAGACGG 

AG7C7CGC7C7G7CGCCCAGGC7GGAGGGCAG7GGCGCAATC77GGCTCA 

C7GCAAGC7C7GCC7CCCGGG77CACGCCA77C7C77GCC7CAGCC7CCC 

GAG7AGC7GGGAC7ACAGG7GCCCGCCACCACGCC7GGC7AA7A7777G7 

A7T777AG7AGAGA7GGGG777CACTGCG77AGCCAGGA7GG7C77GA7C 

TCC7CACC77G7GA7CCGCCCGCC7CGGCC7CCCAAAG7GC7GGGA77AC 

AGGCA7GAGCCACCGCGCCCAGCCAGACC7C77GAG7C77AAAC7CC7C7 

G7AG77CCAGCCACCC777AGCACATGAC7C7G77AA7777G77C7CAC7 

G7C7GAAA7CA7C7CC7G7CCAC7CT7GAC7GACAGG7C7C7GCAC7AGC 

CCACTGCT7AA7CAGAGTAGG7CCCTG7CAAC77AT7CA7A77G7G7CCC 

CA7GCCAG7G7GGA7GAT7AAAA77G77GAG7GGAGGCTGA7CAGA7GAG 

CCA7C7CC77CCAAG7CC7CAC7TGC7GGC7CCTG7C7TAG7777AG7CC 

CCA77CT7CAAAGAACG7GAGCCC7GGAAAG7AT777AG7CA777AG77C 

AG7GCC777GGATGGGAGGA7CACATCCC7GGG7CCCG7CC7GCAGAC7G 

7777GC7C7AGC7GAC7AGGCAGGAT7CCCTGCC77C7C7CAC77CGGCA 

TGGGAC77CC77C7GAAA77GC7GC7CAG7CAAGAGAA7GACC77CCCCA 

ACA7AA7CC7ACTCCACAGGGAC77AAAGGTG7G7CAGAGA7C7C77GC7 

CA7C777C7GGCCAGG7GCCAACG7CAG777A7AGCCAAGGGACAAGAC7 

AG77AGCAGA7CAGGCAGG7C77AGACCCCAGCG7AAG7GCCAGACTTC7 

AGC7GCAG77G77CC7GCCCACAC7GGGCG77CAGG7GGAGAGAGGGCA7 

GGCAC7ACAC7GAGC7CTCGGCGAAACCCAGGAC7C7GAAA7C7CGG7G7 

CAGCCACAGGCCACTC7TTTCAGCAGGAC7TCAG7CAG7CC7G7CAC7AG 

GC7G7CGAGCACA7GG7AGGC7T7ACCCC 

>Contia35 

AAGGAG7G7GC77GC7GA7AGCA7G7G7GANGGGACGAGGAG7AAA7AA7 
T7C7GCC77CAAGAAA77GCAAAC7AG7AA7GGAGA7AAAA7CAACAGAG 
GAACAA77AGAG7ATAAGG7AAAA7C7AAGGGCCA7AAGAGAGGAGAAGA 
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AG7A7GGGAG77CAGAG<aTAGGGGG7AAA7GAGGGGAG7AGG7GGGTA ; GA 

AAAGGTTAAAAGTAAATAATGATGGGAAGGAAGACAAAAAGACGACAGGG 

GTGCCAAAGGACTCTTAACCTCATCTGAACGGAGTTGCCCTGTTTTGCTC 

TCTGATGCTCATGTATCTATCCTTAGAGACAGCTTGGCGGGCAATGTAGA 

GCG7AGGGGC7GACA7AGGGGGT7GGAG7CCCACC7CCG7GAC77C7AGC 

AAATTAGCAAACTTTGCTGCTGCTAAGCCTATAAGGCGGACAGAAATGCC 

ATCTTTAAAGCTTGTTATGTAAAGTGCCTAGGACCTCGTAGGCATCAACA 

GGAATAATGGATGAAACAAAACAACGGTGCGTATCTTGGAGAAAGTGGCA 

TCTGAGCAGGAGTATTTTGAAAGGTAGGAAAGGGCTCCAAGCACATCTAA 

GAGA7TAGGGAACGCAGAAGCCTTAGCCCTGGGTGCAGATTTAACCAATC 

AACTTCTAACCACCGCAGGCTGAGAGGTGTGGAGTGAGAGCCCCGCCAGA 

3GCAGGAGACCCGGGCTTCGGCCAGACCCCGCCTCCTGGTACAGAGGACC 

ACGCCCGGCTCTGCCTGGAGCCAAATGTGGATCAAAACAGCGCGCAGCTT 

CCCACTGC7GGTGAAAACCCGAGCAAGGGGCCTCAGTTTCTTTATCCGGA 

ACG7GGTGACAATGACATCTCTTTGCAAGGCTGCTGCAGGGCTTTCTGGA 

AATACGCCCGTGAGGTATCTGGGCCTGCGCACAGCCTCCCCCGCCCAGGA 

CCCAGACGTCTACCTGGGGGTCCCGTCTGCGCTCCCGGGATGGAAAACGC 

CCAGGGGAAACTTAGGCAGGCGAGCGGACGGGCACCTCCCGCGGGACGAA 

CTCAC7CGGTGGCCTCCTACTTCCCCGGCCGTGTTCCAACGCCTGAGAAT 

AACGGGAACAGCGG7CG7AC7CACCGACAGCGGCAGCAGCGG7AGGCCCG 

GGCCCCACCA7GAC7C77CAG7GACAG77777C77CAAACGCCGCSCC7G 

TAGCCAGGACCGGCG7GCCGCGCG7CCACGCG7CC7CArrGGC7CC7GCG 

GG777GAAAC7CGC7AG7CG7CAGCACGGGAGGGCGGGACAACAGGCAA7 

AGGC7C777GCGG77GGC7C7GGCC77GAGAACCCGACC77GGGGCCC77 

7GA77GGAAGAACG7GCAGCGCACC7CGGCA77GAGGGCGGC77CC7CGG 

GGCGCGGCGCCGCCCGCC7CTGAG7GCGCC7G7GAG7GCGCC7CCGAG7G 

GGCG7GGGACCC7CCGTGGGGGCC7CAGCCGGGC7GG7GG77GGGGGGCG 

GT7ACGC7GAA7CCAGC7GGGG77GGCGCGCCGGGAG7CCC7GGGCGGAG 

AGACAGGGCGG7CCTCCCAGGATGC7GGGGCCGC7ACC7GA77CTGTCC7 

7TCAAAG7C7CAGAC7CACAGGAGC7G7GAAAAAA7AA7A7TA7AAAGAG 

GACA7A7GGG7C7TATGCATC7AAAGGC7CC7AG77C77AGTAC7GCAGG 

G7GGC7CG777AA77G7GG7AAAATATGCA7AACA7CACA7A7ACCA777 

TAACCA7777AAAG7G77AAA77777CAAAAA7G7GCAG777AG7GG7A7 

7AAG7ACCC7CACATTG7GGCACAGCCACCAC7AC7GTCCTTTCCAGAAC 

77777CA7C77CCCAAA7GAAACCC7G7ACCCG7CAC7AAC7CCGCAC7C 

C7CCC7CCCCCAGCCCCAGGCAA7CACCA77C7AG77TC7G7C7C7A7GG 

A777GACAAC7G7AGG7GCCA7A7AAG7AGAA7CA7GCAG7A777G77C7 

37GAC73GC77G77TCAC7TAGCA7AAAG7A77CAAGG77CA7CCA7G7G 

TAGCA7G7G7CAGAA777CC777CC7777AAGGGGGAA7AGCA777CG77 

G7G7GGAGA7GCCACAT777GCrrC77GG7CCA7CCC7CTCCGGACAC77 

GAG77GC77CCAC7T7T7GGC7A77G7GAA7AA7AA7A7GAACA7GAA7G 

CACAAA7AAC7C77TGAGAC7C7CC7777CA77C7777GGG7A7A7ACCA 

CGAAG7GG7A77G77GGA7CAAACGGCAA77C7A77777A A7 7 777 7GAG 

AAAC7GCC77AC7CCTC7CACGG7GA7C7C77G77CAAGG7A7AT777CG 

A777CACC7GA7CAGC7GAC7ATAAGGCCA7AAGGC7AACGGAGAAACGC 

AGGCC7AG777C7CCTAGTTAC7AGGAGATCGCAGGCCTCGT7GTCCTGA 

A7CCC7AGACACACTTCAT7CCCC77G7777AA7CC7AAA77777777C7 

T77GAAG7T7G7CC7GT77CA7C7A77C7CCAG777C77AAAGAGG7C7G 

GAAAA7GC7777GGCTCCTTG7G7A7GAAGG77CC7C77CCA7GGA7GC7 

GGAGAAG7CGTGTGTGGAGGGGCAG7CA7A7CTGGGCACCTGT7GGCCAG 

G77CAGC77ACCAG77GGG7AC7CAGCAGGGCA7GAAGCCAC7GCAGCAG 

CCC77C7C777AGCCG7AAA7AGGGAG777GGAAGAGAGCCAGGG777C7 

GGA7T7A7GCA77TTGA7ATT77CAA7AG7G7A77AAA7G7TTAAAA7AG 

GAAAAC7GA7CATTATTTTTGTTAA7GAC7GAGAAAGGGACTCCTTCACC 

AACAG777CAGAAAAGTGAAGGCGG7777G7777GG7C777G7AGAA7C7 

AGG7GG77GAA7GCA7G7CAG77G7AGAAG7CACC77GCC7GA7A7CCCA 

CGCAG7GC7GGAG7A77CCACAGACCCCA7G7AGG7AC7GCACC777GCA 

GG7A7AC7GC7GG7GTTGG7GAGC7GCC77ACC7G7CC7G7TA77GGAGA 

CCCC7GC77A77AG GAA AC77AAAA7GAAC7CAAA7GAGC77CC77GC77 

AC7GG7CC7AG7CC77TGGAGCAACA7AGGCCAG77C7GCC7CG777777 
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TCCA7CCT77GGG7A77TGACGG7C7A7777G7AGGACACAAAA7G7GGG 

AAAA7AGC7AGGCAGG777AAAAA77C7CAAC7C7ACCAAGCA7GG7GGC 

77A7G7C7G7AA7CAA7CCCAGCAC777G7GAAGC7GAGGCAAGAGGA77 

GC7TGAGCC7AGGAG777GAGACCAGAC7GGGCAACA7AGCAAGACC7CG 

777C77AAAAAAAAAAAAAAAA77ACAAAAA77AACCAGGCA7GG7GGCA 

CACACCTGTAGTCCCTTCTACTCAGGAGGCTGAGGTGGGAGGATCACTTG 

AGCCCAAAAGTTGAAGGATGCAGTGCAC7GTGGTCATGCCACCGCACTCC 

AGCATGGGAGGCAGAGCAAGACCCTG7CTCCAAATAAATACATAAATTAA 

A77C77AAC7CA77CA7CAAAG7A7CCAC7G7AGC7T7CCA7CA7CC7GG 

7G77G77T77777AGAAGGA7C7GGC7CCA77GCCCGGC7AGAG7GCAG7 

GGCA7GA7C7CAGC7CAC7GCAGCCCCCACC7C7C7GGC77AAGCGA7CA 

CCCAC7TCAG7CACCCA7C7GGG7AA77777G7A777777G7AGAGA7GG 

GG7TT7GCCA7G77GCCCCAGG77GG7C77GAAC7CC7GGC7CAAGCGA7 

CCA7C7GCC7CCA7C7CC7AAAG7G77GGGA77ACAGG7G7GAGCCACCA 

CACCAGGACAA7CC7GG7GGC7777AACGG7777CCA77GC7C7CAGGC7 

AA7GACCTA7AAGCCCC7GCGGGC77GGCC7777AC7CCC7GAGCA77AG 

CCACC7CCC77AGCC77AGCCCACAC7AC7C7CCCC77GC7CAG7G77A7 

CCAGACAC777G777777CC777CCA7AC7CC7C7C7G7C7GGGAA7CCA 

ACC777C777C7CA777C7C7AG77GA77A77A77A77777AC7C7AGCA 

GCC77A77GAGA7A777ACA7ACCG7ACGA77C7CCCAC77ACAG7G7AC 

AA77CAA7T77C7AACA7777CA7CACCCCC7AAAGAAACCC7A7AC7CA 

T7AGCAG7CAC7CCCCA77C7CCCC7CC7C7CAGCCCC7AGAAACCA7GA 

A7C7ACTA7CCA7C7C7A7AGA777GCC77C7GGACA777CA7A7G7A7G 

AAA77A7GCAA777G7GG7C7C7GA7GGGC77C7777G77ACCAAAA7A7 

CA7GGG777GA7C7AGG7CC7GC7GC7CGC7GCACAGAAAGCCAGCCAC7 

GAGA7GACAAG7A77GCCAAGGAAGAAGGC777AG7CAGG7GC7GCAGC7 

GAGGAGA7GGGGGC7CAA7C7CAAA7CCA7C7CGC7GACC7AAAACCAGG 

GG777GGA7AGCAGGGAAGAAA7G7AACAA7GCG7AAGAAAACAGGAACC 

AGGGAGGGGCAAGGAAGCAA7CC7GA7GAA7GAG7GG7CCAAAG7C7CA7 

7GCC7GGA7G7GG7GA7C7GGCGAG777CAG77C777GA7AC7777777G 

AGAGGCC7GAAG7C7777CCCCAGGAAGGAAC7CAAACAAAACAAA7ACA 

AGC77CCAGC777AAGACCAGAAGCG7CAA777C7A7G777A7CCGAAAG 

AACAG7C7A7GGGAC7A7TGG77AAG7TTCAC777CAC7TAG7A7GC7G7 

777CAAGG777A7CCACA7AGCA7G7G7CAG7AC77CA77C7777A7GAC 

7GGG7A77C7A77G7GCGGA7A7ACAA7A7777A777GCCA77CA7CAG7 

7GA7GGACA7C7AGG77C777CCAC77777GGC7A77A7GAA7AA7GC7G 

TTA7GAAC7T7CA7G7A7AAG77777G7G7AGACA7A7G7777CAACAC7 

CA7GGG7A7A7ACC7AA7GAGAGGAA77AC7G7G7CA7ACGA7AA77C7A 

7G777AACCA777GAGGAAC7GCCAGAC7G7777CCAAAGCAGC7GCAGC 

A7777ACA77CC7ACCAGCAG7G7A7GAAAG77CCAG777C777ACA7CC 

7CAACAACAC77G77A77G7CCA7C7777AAA77ACAACCA7CC7AG7GG 

77G7GAAA7GG7A7CACA77G7GG77777A777G7A777CC77GA7GAC7 

AA7GA7G77AAGCA7C77777A7G7G777AC7GGCCA777G7A7A7C7C7 

A77CAGAG7CT77GCCAAT7777AAA77GGG7CAG77G7C77C77CC777 

777777GAGA7GGAGCC7CAC7C7G7T7CCCAGC7GGAA7ACAG7GG7G7 

GA7C7CAGC7CAC7GCAAC77CCACC7CC7G7G77CAAG7GA77C7GG7G 

CC7CAGCC7CCCAAG7AGC7GGGA77ACACGCACC7GCCACCA77CCCAG 

C7AA7777777C777G7AT7T7GAG7AGAGACGGGG777CACCA7G77GG 

CCAGGC7AG7C7C777G77GAC7C77AACCA7CC77CAG7C7CAGACAAA 

ACA7CCC777C7CAAGGA77G7GA77AGC77GA77A777GC77A7C777C 

7CCC7GC7AG7C7G7AAAC7GAGGG7AGGCCAC7A7A77CA77G77C77G 

GCACCAAA7AGAAAC7AAA77AA7G7C7777GAA7GAA7AGGGC7TTC7C 

C7777AAAGA7CCC77CAA7ACAG7AACCACAC7A7A7A7AAG7AGCCAC 

AAGCCCA77CAA7AA7AC7AC7AG7NC77GCGCCAAACC 

>Contig36 

GGC7CAGCG77AC7A7AC7GG7C7CAAAC7CC7GGGC7CAAGCGA7C7GC 
CCCCC7CGGC7TCCCAAAG7G77GGGA77A7AGGCG7GAGCCACGG7GCC 
7GGCC7CAAA7AAC7A777AAG7GAAACAAAAC7AG7A7GGCAC7AA7GA 
AAAA7GTA7AAA7CCA7AA7CGCAGAGGGA777CAAC77AC77C777CGA 
77A7G7AAAGG7CAAACAGACAAAAGACAA7GACAAAAC77AA7GCAA7G 
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AACACTTTTGATTTAATGAACATATATTGGATATGTACCCAAGAATTAGA 
3AA7ACA7AC7AG7777GAG777A7GCAGAACA777ACAAAAA777AG7G 
GAAGCC7AAA77A7AAAAAG77GC7G7CACG7AGAA7AACACACAAACCC 
37GAG7CCGGAA77CAAAGCCC7CCACAC7C7CC7C7ACC777GCA7C77 
7A7CCTCCACCACAC7GCAG7GCA7AC7C7GGGC7AC7AC7CAC7G77C7 
TGA77CAAA77CCA7G77C7G7CAGC7CAAA7CA77C7C7C7GCC7GGAA 
TA£C7AC77CA7ACA7A77C7GC7A77GAA77C77G7C77AGCACCCCA7 
CTAC7CCAAGACGA7G7CCAG77GGGG77AC7CCC7G7CCCA7777C777 
GA77ACAC777777777C7AC77CCA77A7A77A77GA7CACA7C7G7GC 
CACAG77777GAC777G7G7C7GC7777AC7C7777C7AGACCC7GA7AG 
C7CC7GAAGGG77GGG7CA777C777777A777GC7CA77CC7CA7GGCA 
CAG7GAG7GC77AA7AAA7GGC7A77GAC7GAAA77AAAC7G7A7C7AAA 
7GGACA7A77CCAC77C7GGGCCA77CA77C777C777C7A77GGAACCA 
GGAGA7GGGGAACCA7AACAAAGG7AAGG77G7GCCA7G7GAAAGAACA7 
GGAACC77CCCC7GAGGGCCAAAAAAGAGCAGGGAAAGG7GCAAAGACAA 
AA7C77CCA77777AAACAATGTAAGAA7G7GG7CCACC7CATGC7CAGG 
7GGGAC777A7CA7GACG77A77777GGGGAC77A7AGC7GCA7CA777A 
CCCCA7A7ACA777ACC777AG7G7AGGGAAC7GAGGACAGGAA7777G7 
7GA7GCAGAC7C77GC7AA7GAGGC7AACAC77GGAGAA77777A7CA7G 
CA77CAAGAAGC77G77TTACA777C77CA77AA7AC777AG77GG7GG7 
T7AGC777AG77G7AGGC77A7CAGA7A777GGAGA7A7C77CA7AAACG 
A7GGC7TTGG7777AGAAGAG77A77C7GAAGC7AC7A777C7GGCAA7A 
A7CAAACAGCA7GGCCA777G7777G7AAGGCC7T7CC7AGAA7A7GACG 
37AAAA7C7ACG7G7GGAAAAA7GC77A77C77C7G7CC7CTA7AAA7G7 
GAA7C7AG777G7C77CAAAA7GAAA7CAAG7GA77AAAA7G7AG7777C 
7AAGAAGA7AAA7GGAGCAAAGCAC7C7G7G777CACAG7G77GGAAA7C 
AC7CA7CCC7CA7AAAAC7GTCCCAAC7GA7CC7GAC7CACA7GAA7GAA 
77AAAA7AAGAG77AA7AACA7CAA777ACA77777AAAGACAC777CCC 
A7G7777AGAC7A77GG77GGAAAAGC7GG7AGG7G7ACAA777G7GGAG 
AGTTGGC7G77777G7C7G7CG77G777GACG7A777CAAAGCCA7A7C7 
AATT77G77GCAGAA7GG7CTGAA77C7ACAAAAA7G77GAG77G7G7AG 
7G7GGAGAAG7ACGGAGCCA777AC7GAAAGGC7GGGGGGAAA7GACGAG 
ACCC7GAGA7AAGGCAG7AG7GG7GCGAACAGAG7GGAAGGGAGG7AG77 
GAGA7A7G77CAGAG7AGAATCAGAA7GGACA7AG7GAACAAC7GGA7GC 
AGG7GGGGGC7GAGGAAGCAAAG77GAGGA7AA77C7GAGAC77C7AGG7 
TGA7CCAC7GAAG77ACA77A77CAACACCACAAGGAAAC7AGGGGAA7G 
AGAAGGCA7AC7GG777GC7T7GGAG7GGAAGGGCAG7GA7G7AAGAGGA 
377AA7GAG77AAAG777GGA7A7GCC7GAAC77CAA777GA7A7G7GCA 
TC7GA7A7ACCC77GGGG7GACCC7CCAGGCAA7GG77GAACA7G7G7A7 
T7C77AG7AAC7GA7AGGCATCACAGAC7CACA7CAG7AAGGAAGCAACA 
GCAAAC77GA77GGACGA7ATACC7GGAAC7CAG7ACCC7A7GAC7GGAG 
CAAG7C7C7G7CAG7GAAA7GAGGA7AAGAAGAA7C77GACC77G7GGAA 
TATG77G77AGGAA7A7A7G7GA7GAACAACA7AGGA7AC77CC7ACAGG 
GC7CCACA7G7AG7AAGGGCT77ATAAA7GC77GA7AAA7A77A77G77G 
7AATTTA777CCAAAG7AAGA7GCCAC7GGAGGAA7C777GGAACCCAAA 
T7AA7AACAAA7AGGAC7GGA7GCAA7GGC7CACACC7G7AA7CCCAGCA 
C777GGAAGGCCAAGGCAGGAGGA7C7C77GAGCCCAGAAAT7CAAGACC 
AGCC7GGG7GACACAGGGAGACC7TG7A7C7A7GAAGAA77AAAAAAAA7 
7AACCAGA7G7GG7GG7GCACGCC7A7AG7CCC7GC7GC77GAGAGGC7G 
AGGTGGGAGGA77GC77GAGCCCA7GAGG77GAGGC7GCAG7GAGCCA7A 
A77G7GCCACCACAC7CCAGAC7GGG7GACAGAG7GAGACCC7A7C7CAA 
A7AAATAAA7AAATAAATAAATAAA7AAGTACAAACCAGCAAACAC7AAT 
CC77TC7AGAGA77AT7GAAC7C7GGAGGGCAGA7C7GAA7GGAGCCAGC 
AGAGGGACC7A7GGAGA7CAGCC7GGCCC7GGACAGCACCAGGCAA7GGG 
G77GC7AGAGAGG7AA7GGGG77GAACAGGG777AAGCCA7GAGG7C7CA 
AGAA7CCG7GAAGAC7CAGAC7AA7777777777777GCA7GAGGA77AG 
G7G77CC7AGGAA777CAA7GAGAGCAGGG77AA7GAAGGAA7GCAGGG7 
AGGAGAGC7GAGGGAAGGCATC7GAGAGAGCC7GGC77A7GAA7GGC7GC 
37CAG7A7GGC7CACC7GC7T7CC77G7A7C7AC77AGCAGA7GA7CCCA 
CCCCAGGCC7CCAGGGCCAAGG7CA777CCACA7AG7CA7GGGCCC77GA 
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GGGCCTGGAGCAGTGTA^^GAAGACAGAGTCTTAAGAAATTGCA^MiL * 

G7CA7GG7GC77GGCAAG7G7CG7CATCCTA7GCCAAGCCTGA7C7GAAG 

GGGTGCATGCTCATAGGTAGCTGCTGCCCAAGATTACAGCAGCTTCTTCA 

ATCCCAGATCCATGCTCTCCTATATTCATTTTTCCAGGGG7TCCTGTCCT 

7CGACAGTGATGAGATGCAGAATGACTTATTGAGTTATTCTCCTGATAGT 

TGCC^ACTTTTCCAAATGACAATGGGGCATGGAGCTTGAGAGTGGAAATG 

AGGCCC7AGGGATAGCG7GC77AGGAAAACAC7CCCAGCC7GATGTAA77 

C7GGGGG7ACAA7GGCA77T7CA7CA7CAAGAC7GA7G7AAAGGG7GAC7 

AGCAG7GAG77GGGGGTGAC7CGCAC7GGGGC7AGG777C7GA77C7GCC 

7AA7CCAGACAGAGCAGAAGCACTAGTGGGC7GGTAGAGGGCC7CCAGGG 

CCTCAC77AATGTCCTGGAAAAACAGCTCCAGATTGTTGGTTCACGTTC7 

GAGGACAAGC77GGG7AC7ACAGGA7AGAGAGAG7GG7GGGAGA7GCCG7 

GGCC7GCCC7GCTGATGCC7GCCC7GCCA77CC7GCG7GTGA7G7C7C7G 

GGGCA7C77GCCTTCCCTGCCCAGACCTGTAGTTCAGC7GAGGGCATG7G 

GAGGCCAAA7GGCTTCTTAGAG7GTTACTTTCCTTGAACAGCTC7GC7GG 

GAGAAC7GGAGGAGCTAGCTAG7CACGG7AAC7GCAGCAG7CAAAGGA7C 

G7CCCGG7GGAGGTGGGG7GGAAAGG7AGAGAAAGAGAACA7A7AGCG77 

7TCC77GGAGATGTGTGGGCATGTCATAGAGGAAA7ACCCAA7TCC7GAG 

CCT7GAGCCCTCCAGGAAACCTTGGAA7A77AGG7TAG7CATCCCCAAGG 

AAG7C7AAGAA77C7GG7C7CACCCA7C7CC777AA77CCCACAA7GA7C 

C7ACA7GA7A77AAGGAACACGGGCCAG7AACCC7CCAAGCAA7GGA7G7 

3G7GG7GAAGT7TGACC7CA7GATGGAGCGGAGG77GG777GAAACC7AA 

3AA777AA777AT7G77TCAAACTG77C7CCAC7CAGCG7TA77AAAGCA 

7ACA7AA77GACACA7AAAAA77G7A7A7G7C7ACGG7G7ACAA7G7GA7 

G7T7CGA7C7A7GTA7ACA77GTGAAA7GA77ACAACAAGC7AAA7AACA 

7ACCCA77CA7CGTGT7TCAAAGGAA77AAAC7CAAGCACAAAAGAGAGG 

7GC7G77GAAGAGTAGGGC7GC7C7A7C7AAG7AG7ATGTC7GGGG77G7 

CCTGGA7CAGGG7CCT7T7G7GC7AG7AA7AAACCAGCCC7TC7GGGGC7 

GCTCCAC77TCCCCACAT77TC77C7GGAGCC7CCC7AAGAATTAGGACA 

TGGCCAC7TTCTCTGCATAGGCTTCCTACTTCAACAAGGACAGGGCTTG7 

GCTGCCCCA7GCCACTTGAG7GTCCC7ACAGCACAGAGCTGAGTGGACAC 

7GGC7GAG7GAGGAAA7CCCCCAGAT7AA7C77GGTTC7AAGCATCATGG 

C7G7A777CACACG7A7A7GAA77ACAAA77ACAGCA7AG7CGAA7AAGG 

A77777G7GC7ACAAC7GGAA7CCCAGA77A7GCAAA77GGA7AG7A7AA 

7ATTGAAA77CCTAGGACTTT7TATTAGT7T7AAAAAATTATACAAGC77 

AGAG7AAGAAA77AAACAG7GCAAAAGAA77CAC7GTGAAAAG7AAAA7G 

C7C7G7C7C7GC7GAGAGACAGATA7TGCAGCCCAGA7AC7AC7GGGG7C 

AA7AG777CC777AAGCATGCCA7777GA7GG777A7GGGAC77ACAGC7 

CAAGAAGC77GACAC7AGGG77GA7C7CAGAAAA7CA77G77GCAGG7A7 

7AGA7A7GACCG7C7CATAAAGA7ACACACACAGACACAGCGA77GGAGA 

7A77CAC7GGGGC7TATGGGC7GC77GTCCT77C7GC7C7G7GCCTAAG7 

7GGGC7CAGAGTAGCC7GGCA7CGGCTGTGGGGAGAATGCTGGCATGGGG 

7TAGCAGGAGCCCACT7AACATGTCCTAAGCCACC7GGAAGAGTCC7TCA 

AGGAGACCAGAC7CCAGAGGCCCTAAGGAAGGAAGGAC7TTTGCCCGTT7 

77AGG7A77C7AG7CCCAGAG777AGGGAGGAA7GG777GGC777GGG7C 

GTGTGCCCCTTTACCGAGTGGGATGGGATGTGCCCATGAGCTGTTGAGC7 

GGC7C77GGAGAAGACAGCAAAAGCGGGAATAAGAGGTCAGGAAGCTGTG 

7GG77G7AGGAAATCCCAGCAGAGGGCC7GGGGG7CAAAAG7GG7CATGG 

7AG7GACGG7GGAGGCTGAGGTGG7AGAAAATCAGAGGACAAACCCCATG 

GGC7GC7GG7GATCTGACCGAGC7CCTATGCTCTCC7GG77CA7TTTAGG 

CTC7G7AGCAGCAGATGATTGGC7GG7GTGAGAGCAGTGCACCTGCCATA 

7CAGGCAA7CCAAGACAAG7CCAAGCTACGCTGGGAGGAAACC7GAAGGC 

AGCAGCAGG7AGACTGGCTGAAGACAGACAGGCAGGCAAC77G7CAATCA 

GATT7G7G77TTTAAGGACT7TTAAC7GGGGAGCCC7CCGGGACAGATCA 

GATGAGAG7GAAATGTGCTCCGCCTTAGCC 

>Contig37 

GGCCG77CGCAA77C7G7AAAAGGGAGAG7GG7777A777A77777AAAC 
A7AG7CAAGC7GC7AAAGTATA7GA7A7G7A7AGA7AGAG7A7AAT7AAA 
7AC777CAAC7ACAGACAAAA7CAGGAGAA7GGAAT7AAAAAACAAT77A 
GAAA7GGG7AATGGCAGCATTGGG77GCGCCCACCCACGAGAAGGCAGAC 
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ACCAAGA7TCTAAGA7C«^CGTG^^ 

TTC3TGATTATGCATTATTTTTCTCGGAAAGTTTTCACTTCACTATATGC 
TACTTGACACTTGCTTTCCTAAGACATCCCTCTATTTTTGAGATGACTAA 
CTCAGCAAT7CATTTCTCTCACGCATAAGCTGTCACTCAACCCAAACCCA 
CCAAGCCTGCATTCTACCCTCAATAAGGTCTTGGTGTGTAAACTGACCCA 
CTTCACCTAGTTCCTTAGCCCTCTCTTGACCAGACATGACTCTTTCATAA 
3CTAGACCTATAAAGTCAGGGCTCTTAAGTAGCTGATCTCTGATAGTGCC 
AA&TGTCCCCCACTGTTCACATTTTCCACTCCAGCTTCTAACAGGTGATA 
GACTGCTTTTTGGGGGTAGGGGCACCAAAACATATAGACCTCATGTTTGG 
ATGTAGACACTCCAGTTTCTTTAAATTACAACTACATATTAATAATGACT 
TCCAAGTGTACATTTCAGTCCAGATCTCTCCCTGGATCCCCAAACTTTGT 
AAAACCCACCGCCTAGT7GATATCTTTTGA7GTCTGACAGGCA7TTCAAA 
TTTAATACTGTCACAAACAAAGTTATTGATTTTCATCTCTGCATCTGTTA 
CAAATTTTTCTTACTTTGGTAAATAGCACCCCAGGCTGTGTCACTGCCAA 
GAACTTTCCACAGCTCTTGGAATAAAATTCAAAATATTTTCCAAGGCAGA 
AAGGCACAGTGTAATCTGGCTCCTGCCTACCTCTCCAACCTCGTATCACA 
CTAG7CTCCCTGTCACTCACCCCCTCCAGGAGCTCAGGTATCCTTAAAGT 
TTCTTTTCTTTTTTTTTTTTTTTTTTTTTTTGAAACAGTTTTGCTCTGTT 
GCCCAGGCTGGAGTGAAGTGGCATGATCTCAGGTCACTGCAACCTCCGCC 
TCCTGGG77CAAGTGATTCTTGTGCC7CAGCCTCCCAAGTAGCTGCAATT 
ACAGGCGCG7GCCACCACACCCGGC7AA77777G7A77777AG7AGAGA7 
GGGG777CACAA7G77GGCTAAACCGG7C7CAAAC7CC7GACC7CAAG7G 
A7C7GACCAC77CAGCC7CCCAAGG7GC7GGGA77ACAGGCG7GAACCA7 
TG7ACCC7GCC7CC77GAAGT77C77GA7CCAGAC7CA77CC7GCC77AA 
GG7C77GCA7C77CAG7CC7CCCC7CAAA7GACACC7CCA7GAAGACGCA 
A77ACC7G7AA77ACCG7GTC77ATT7AG7CAA7G7GT7GG7777C7G7C 
7CC7CCAC7ACAG7G7AAGCTC7A7GAAGGCAGAAACC7TGGCAG7CCAG 
77CCCAGCACAG7GCC7AGCACACA7AGG7A777AA7AACACACAG7AAA 
A7TCACC7777AG7G7GCAA77C7GAG7777GACAAA7GCA7CAAG7CA7 
77AAG7C7GAC7A77A7CAAGC7A7AAGA7GG77GCAACAC7ATCACTAA 
7TCCC7CA7GC7CC77GGTAG7CAG7C7CACCCC7AACGCCCCCC7CC7G 
GCAA7CAC7GA7CCG7777T7G7C777A7AG7777GGT7777CCAGAA7G 
CCAA7AAC7AAG7T77GAATGAA7GAA7GC7A77AACTC7CATT7C7GAC 
7CCAGAGCAAC^7CCA7GC^ATA777A77A777CAGCCCCAAA7AC7GCC 
CCC7CACC77CAC7CCAACGACC7AC77GA7GA7ACAAGG7GAGACAT7T 
GGCA7G7GC77CC7CCA7GTTCCTAGCA7777CCC7ATC7CC77AGCC7T 
CC77C7AA7CA7AAACGAAGAG7GAAC777CCC777CTAAAGGCAAC77A 
C7CC7AGGACC7CGA7GCCA7AA7777G777C7C7AG7AC77TC7A7A7A 
7ACACCAAACAA77AGC7CCAGAAAGG7AAAGACTCAC7GTG7GC7CA7C 
AC7G7G7C7CC7AGCGCC7GGCACAC7GCAGG7GC7GAAGAAACACC7AC 
AGAA7GAG7GAA7GAA7C7CTCCC7C7C7AGAC7CC7TCTC7777G7AA7 
CAAACA7G77CAACC7GCAACACAG7C77A7GACCAATCC7C7G77G7CT 
GACC7AGGC7GAGCTCCAGGGC7GGGACCC7GACT7CC7TATTCACCACC 
7CAAGG7C7C7GCAC7CAC77C7C777C7GC7CAGGA77G77777C77C7 
7GTCACCAG7C77TTCTCAGACTTAGG7C7CAGC7CAGACATTGCTGTTG 
AAAG7AC77C7AC7GA7CCTTTTA7CTAAAGCAGCCATTCCAGCCC7ACT 
C7C77GA7CA7AGCACCCTGAATTAAG77G777ACTTAC7GTC7C7TCAG 
GAGGGCAAGGAGC77GG7GGTGGTG77CAGGGC7G7ACCAAGC7G7ACCT 
7GC77CACCC7GC7ACAC7TT7TAGCAACCA7C7AAT77TACA7GC7CCC 
T7CAC7CG7CAGAAA777CC77A7777C7AC77CAAGCAGG7A7ACA7A7 
G7GC77C7CC7GGGAGGC7CACCCAC77CA7GAGAC7ACA777GG7CC7G 
GG7AGAAAG7G7ACAAAA7CCAC7GGC7CAG7777AATCAA7G7A7G77A 
A7A77AACCAACC7GAGATCT7GA777CCACGCC7GGC7AA7777G7A77 
777AG7AAAAACAGGG7T7C7CCA7G77GG7CAGGC7GG7C7CGAAC7CC 
CGACCTCAGGTGATCCGCTCACCTCGGCCTCCCAAAGTGCTGGGACTACA 
GGCA7GAGCCAGCG7GCCCGGCCTAAGA7C77GA7TTC7ACCATCTGAAC 
7C7G7A777GAAC7GAC7GC7CC7GC77GAGC77ACTGGCCAAAAC7TGG 
CCCAC7CAGAC7CACGGAAG7T7C7GG77C77CCC7GG7AAC7777C7GA 
AC77AACCAC7GG777GC7TGACAAGAGA77ACCA7CT7C7CAC77CC7A 
3C7A7G7GAAC7CAC77A7C7GC7C7A77GC7G77CAG7C7AGCACGGCA 
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CTTATTGAACGAGTGTCTACATCTGCACCCCCTACTTCTTACTCATCCAT 
TCTGTTTCAATTTCTTAAAAAGAAAAAAAAAAAGCTATTGTAAACATACG 
ATTA C AG AAAATGATTTATAACATGTGTATGT AC CAC CTAGC C CTGTCAA 

GTCTTAATATTTGTTATATTTGCTTCAAATCTTTTTTCAGACTGTAGTTA 
AAAATTACTTAGGAGCCATTATTTATGGCCTATTTCCTGACCTAGTCTTC 
TTGATGG7CAATTTGCCTAATCATCTTAAGTTGCAAAAGCTTAGAATTAA 
AGgAAAGTACCTTCGATCCTCTGCTGTTGCCTTCTTTTTAATATTTGGGT 
TTGTrTGGGTCCCATTTACGGTTGTGACATCAGCTTGAGTTTTGGGAGCT 
GTCTTGTTCAGAAAATGGTTCTGGGGAACAGCCTTTTTCAACTTGGAGTC 
CAAAG7CTGTGCTTTTTGCTGAAAGCCATTATTGTTATGTTTATTACCAC 
TGGTTCCATTTGGTCTTATGCTAGGGGTGCTTGGAATGGCTGAATTAAAT 
CTGCCAACTG7CAAATTAGGCCTCTGGCTTACGGCTTTTGACTTTTGCAG 
TACACATGATGTCTGAGGTATACAAACTTGGCTGGACTTCTGATCTTGCT 
TGATGTT7GGATGTCTGTTGTTATATTCACCCTGAAGCAAACTGGGGTAT 
GTTCTGGGTTTGGTGTGCTTCACTCTCTGTTCAG7AACAGGGTATGACCG 
7A7C77AG7T7CATTTGGTCTTTCA7ATTGACTCC7ATTAACCTTTATAT 
C777GA7G77C77GAC7ACTGGT7TC7T7GATGACTGAACTTTACTAAGG 
GTCCGAATAAAG7GAGAGGGAACCG7CC77GAGGG7777ACTCC7GGTCT 
7GCAAGA7C7GC7CCTCTAGAGAG77GC7GTGA77T7AC7GGGAAAGTCC 
TGC777G7G777C7CCAACAAA77G777A77AACCC7A7C777CAGAACA 
GCAC7A77AAC7GAAC7777GCCCAAGGC77G777AGGAAC7AAAC7G77 
CT7GG7T7GA77A7AAGAGTCAG7C77TGGC77ACT7C7GG7A7A7AA77 
7AGGA7C7GGCT7CCTCTCAGG77C7G77AAGA7A7C7AGCAAG77CTC7 
77G777G777C7T77AGAAAG77ATCCAAAGAT7CG7777CAACATGGA7 
A77A77CA7AAAG7C7A7ACA777ACCA777CC77GA7C7G77AAC7GC7 
GCTT7GTAGT7TTCAATTGCTCTATATTAAG7GACCCCACAGG77T7CT7 
GACAG7CC7CCTG7GGTGGACTATC7AGC77CACAC7G77GAAAAC7CT7 
GCTGAAAAGC7TAGACTATGGGT7AGAAGAAACACA7TTTGAAG7CCGCC 
TTTTGCCCAGAAGTTTTGGTGGCTCTAACTTCAGC7TCTGGGACCC7GCA 
GTATTAGG7GGTCTGGGCTGGAGTT7AATGCTGA7GGACCTTT7AGGTTT 
GACAGGCAAAACAACATGGTTGGTAACATCATT7T7GGGTCTAA7AGTCT 
GAAAAAACAAAGAAAA7ACA7A77AAAAAATCC77AACA7A7C77A77G7 
7TTTAAAA7AATAACTG7GTTTAACACA7GC7AAAAAAAAAA7CA777T7 
AGAA777CA7CTAAGAAAG7TGAA7CC7CAGAAAG7AAAGAAAGAC7CAC 
7AATAGG7AG7T777GTGTTTTTTT7777T77T777TTTGAGACAGGATC 
7TGC7CTG7CACCCAGTC7GGTG7GCAG7GA7GCAA7C77GGC7CA77GC 
AACC7CTGCC7CC7GGGTTGAAGCAAT7C7CCCACCCCAACC7CGCAAG7 
GGC7GGAC7ACAGGCGCA7GTCACTACACC7GGC7AC7TTTTTG7A7TT7 
7AG7AAAG77GGGG77TCACCATAT7GGCCAGG77GG7C77GAAATCC7G 
ACCTCCAGTGATCCACGCACCTTGGCCTCCCAAAG7GC7GGGATAACAGG 
TA7GAGCCACCACACCTG7CCTAACAGG7AG77777ACAAC77GAG77CC 
7ATCAGAAG7ATAT7AGAATCTTTTAGCTTGACAGAATTAAGCAGAGATG 
CAGTGAA7A7ACAAAACTTGCTCTT7CAAAAA7GAATT7GCCTCAAACAG 
7AG77G77GAA7GCC7A77A7A7CC7AAG7GCCC7CCAAAGAACCC7GAA 
AAAA7ACATACATAATGAACTTATGTTAGGG1ACCTCCCAACAAATCTCT 
CCTAG7AC7TTGTA7AGCCACACTA7A7GTTT77TAAACCACTGCCTTTG 
TAAACA7CACAGTA7CACTCAAGAACCTCTGTC7CATCCCTGGAGATCAG 
7GACAAGGAGATAGGTGGCAGATGATGTGAGGCCTGAGATATGCTGCCAC 
AGC7C7CAATAAACA7GTAACATCTTAATAGTCATATTTGTAAAATCAGC 
CAGGACAGGG7777AAGG77AGAG7C7A7G77AA7AA7AAACAAA7G777 
AGTCA7G7GATTTAAGTTTGGATAAGAAAGG7AGGACTCGATTACAGAGA 
A7T7TGAAAAC7AGGGAAGGGAG777AGAAT7CATA7GGTAAGTAA7TGG 
GCAAGCCAC7A7GAA7TCCTGAGCA7C7C7CA7GAAAGCAATTACTCAGA 
AAGGAGAA777CACAGAGA777A7GGAA7A7G777CCAGGG7AAGA7A7G 
GGAA7GC7AGAG7TACCACTCTATTTT7GATT7GACAAATATTGTGAAGA 
ATCAC7ACA7AAACTTGGCGAGTATGTAAAGGA7T7CTAACCAGAACCAT 
T7GGCA77GAGGGCAAAGAAATG7C7AC7C7GGA7GA7AGCGG7G7G7G7 
GG7G77ACTAGGAG7GAAACAGCGGAG77GGGAG7GGGAGGCAGAGAGA7 
GGA7GG7A7ACCCACAA7GGC7A7A7C7GGAT7AA7C777GAGCACCAAC 
A777A7A7ACACCTCGGATC7C7CCA7CA77GC77AC7GAAGAGG7GGAG 
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GGACGTTGGCATGAAAGLiTCO\AATGTGTTTTTTTAGTTGCTi¥cr'TAi- 
ATATTAAAAACGAATTGATATAATCCACAAACCATAAAATTCACCATT r, ' T " 
AGTAAGTGCACACTTCTGTGGATTTTAGTATAGCCACACTATTATACAGC 

AATCACCACTGTCTAATTCCAGAACATATTCATCACCCCTAGAAAGAGAC 

TTGGGTTTACTTGTTGGCAGTCCCTCCCCA 

>Concig3 8 

GGTCTACATGTGCTCGCAAGATTGGATATTGAAATATCAGCAAGAAATTA 

AATGACATAGTAGTCATTATGCCTAAATTATTGTTATTTTTTGATTGAAA 

AAAGTTGAATATTTCAAATATCAAGGTAGTAGTGAGATATAATAAAGAGA 

GAGTCAGTTCTAAGTATAGAATTGCTGATTCAGTTAAGCTCTGTTCTCCA 

ACATTTGGGCCACATTGAAGAGACCATGTAGC7GCTTTCAGCCTCGGTTT 

CCTCCTTTGCAAAATGGGGATTACACTACCTGCCTCACAGAGATGTAAAC 

TTATGACATGTTATCATGATTGCCAGGGCCCACCTGTTT T CTTTTAAACA 

TTGAAATCACTGTGCCTGAAACAGGGATTTCCCTGCCCTTTGTGCAAGCT 

CCAGAAACAGGAGTCAGCCTGAGTCCCGCAGCTAAGAACGTGGATTCTGG 

TCATTTTCTCATAGCGAACACACTTCACAGGTCCTTCAAGGGAGTACATT 

TTCCTATAACTCACCTTAATCTCAGTTGAAGCCTCGTTTCTTATTTTGCA 

CTGTGGCCAAAAACTAAATCTCATTTCTTTCACGTAAACTTCAGCAATTC 

AATAATAGTACAGTCATTTTATGTTTCAACTGAACCAAGTCAGGGTTCCA 

CTCCTGCCTCCCCTTTCTGCTCTGAGGACATCCATGAAGTGGAGGGGGTC 

TATGTAGCCTGGAGCTATTGGTGAGGGGCGATGGGTCCGTGGTGGTCTTG 

GGGAACTGCGGGGCTGTGTCTGGCTGGTCTGGTGTCTGGTGATTGGCCTT 

G7TCCACGCGGTTCACGCTGCAGGACAGTTCGTGTCCTTCTTGTCCAAT 

GATCAGCTTTTAGGCTCACGGGCCTGTCTCTGCTGAGATATGGAATAGGA 

CAGCCTCTGGATCTTCTTTAAACTCTCCTGGGGCCACAGGGGAC7CTGTT 

TGTGTCTGTGCCCACATAGGATGATTCTGCCCAGACCTTTGCTGCCATTT 

CTTGCTGTTCTGCTGTTTTTAGTCTCTGGAGGGCTTGCAGTTTCCTTGGG 

G7CCC7G7GGAAGCAAAGCAAAGTCC7C7CCACGC7CAGA7G7C7AAACG 

TATCTGGGTTTTATCGTCCACCCATCCCAGAGCTCAGTCTAGAGGAGGGG 

GCAGCCTTCGGGTTCTCTCCTTCCTCCCAGAGCCTCTTCCTTTGCACCAG 

GGCAGCCTCTTCCTATCTGTTGGAAAGGGCTGTCTGGTTCTTGAATATAG 

AGTTGCAGGTTTGAGGGGTGTAGGCTGAGGTAAGGCAAACTATCACATGG 

AATAAAAATTACCCTGTGTCAAGGAACAACCAGAGCTGGACAGTTTTTAA 

ATGTGAAAACCAATTTTATTCAGGACTATGGCGAGAGGTGAAGTAAGACC 

TCAGTATAGAACTGGGCTCAATTCCGAATGCAGCATGGGCAAATGGGAAT 

GTATAGCCTAGGAGCAGGGTGGGAACCTGTGGATGAAGAATTACTAAAAG 

GGCA7ATCAGGGGTGAGGGGGCGTCCTGGCTACACCCACTAACTACTGTT 

GC73AAGAAAGGCCTGGTGACATCACTGGGGAA7GG7GGGGGA7GAAGAA 

rCCAATCAGATGGATATTGAGGATAAGGGGATCTTGATAAACTGGCTTAG 

3AGGG7TTT7GC7AAAAC7GG7TTTCA7AGG7AAG7CCACAGACAGG7C7 

TGGAGAAAGT7CAGGGACCTACGGTTTGTTCGGGCAGATGCTTTGTCATC 

7GTCACACTGGCACTGTCACCTGGCTTTCCTTTAGTCCCTCCCCCCCTTT 

TTTTTTTCTGGAGTAGTTTTGGGAGACCAGAGGAGCAGGGAGTTAGGGAG 

AG7AG7CAGAAAAGGCCAGAGAAAA7AAGGAGG7G7C7G7AGGGAAAA7C 

CTTAAATCCTCTAATTAAATTAATTTAATTTATTTATCTGGGACAAGGTC 

TCACTCTGTTGCGCAGGCTGAAGTGCAGTGGTGTGATCTCGGCTCACTGC 

AGCCTCGACCTCAGGGCTCAAGCAGTTTTGCCACCTCAGCCTCCTGAGTA 

GCTGGGGCTCACAGGTGTGCACTACCATGCCCGGGTAATTTTTGGGTTTT 

TTTTTTTTl'TTT lT lTTTTT'rTTTTI T rGTAGAGATGAGGTTTCGCCATG 

TTGCCCAGGCTTGGTCTCGAACTCCTAAGTGATCCATCCACGTCGACCTC 

CCAAAGTGCTGAGATTACAGGCATGAGCCACTGTGCCCGGCCTAAATTCT 

CCAATTTTTAAATGCTTCCCTGTTCCCTGTTCCAGATTTGGGATATTGAC 

TGCTGTTAAATCAGCGATTTCTCCCTGTGGAGAGGTAGCCAATAGGAAGC 

AACAAGAGTGAGGAGTCCTTATATCGAAATAGAGGGTAAGAGAAGAGACA 

GATGTTATC7TGGCAGTGATTTAAGAACAGCGAGTCTGTAAGCAAAGCAA 

AGCAAGGCTCCCAGGTGCTGAGAAACAATGGCTTTCTGGGGAAGCGTCTG 

TGTTCAGAACCTTAAGTTGGAAACATCTCTGAAGATGTTTGCCATGAAGG 

TTTTCTTCTGAAGTTGAGTCTTTCATCACTAGGTAGGCGTGTTTTGGAGT 

CTCTATCAAACAGATCCTGTGTTTATTAGGAAGCTGTGGTTCATAAAGCC 

2CATGCTAATTTTGCAGG7AGCAGGGTGGCCCTGGCCTGACCCGGGGACA 
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3AG7GGC?Q7CZ7£CC^ •CAGGCAGGAAACTCTCTCCTGCCACfCT AGT^ 
CTGCATACCCACATTTCAAGGGAGCTTCTGGGTGGTGAGTTTACCAGACT 
ATGGTCTGAGGTAGAG7TAAGCAAAACAAAACTAAACTGCATAAAGAAAC 
AGAAAGAAAATCAGGTGTTATAAAAACAATTTGGCATTTGTTTGTGTTTC 
AGCTCCG7GTCGATTTATTGCTTCCACAAATAGTGCCGATATGCACCAGG 
CACTGTTGTAAAACTGAAAATATGTTTTTGGATGTGCCCAGTCTGTGAGT 
ATTAAACGATGGTTGATTTGAAATTTGCTATGATTCATATTTCTGGGGGT 
ASGATGCAGGATTTCTTTGGGGGGCCTACGATGTGGCATTCTAGAATTCT 
CAAAGAATCAACCCTGGTGGGACCAGGAAGAGCTGAGCTGAGGCC7CTCT 
GCTCATGTGTACTTACTGGAGATCATGGAGACAGGTGAGCCTGAGTGCAC 
GTCTCACCAAAGCCACAGCAGAGGGGGAGGAGGCGGAAAGAGAGCTCTCT 
CCATTTCTGAGAAGTTAATGGTAACAATGGCATACATACCTACTTTACAG 
TTGAAATTGGAAACGACAGCATTAAGTGTTTCCAATGAAATTTGGCAATT 
TGGGAGTTTTCTGAGCTGCATTGGATGTGGTTTTGCATGCTGTTAGGATG 
AGCAAGAGATGATGGAGAACATCTTCCTTTTGAGCTTCCTCTTGGACGTG 
GGTCACTCCCACTCATGGAATTAGAAAGCTTAGACCTAGACTTGAATCTC 
ACCTTCTCAAGGTGCTCCCGGGCAAATCACTTAAGATCCATCTTTCTTCTC 
CTCCTGCTCCTTCTCCTCCTTCTGAGTTTTTTTTTTTCTTTCCAAAATTC 
AAATGACACGGTACTGGTAGAAGAAAAGGTCCAAGTCTGCTTTTACAGCT 
CCCCTCATCCCCAAATGTACTCCGACCCCAAGATGACCATGTTATCATTT 
GATTGACATCCTTCTAGTTTCAACTCATTTCTTTGCATGTATATGCACGT 
ACATATACACTATTTTATTTTGCCAGGGGTCACCGTTTAGCTGCATTAAT 
TTCTTATAAAATAATCTATATTTACTTATGGTTTACGTAAAACAACATAC 
ACATGTAAGTGTATAGCTTGATAAGTCTTCACTGTAAACCAAAAATAAAA 
TTCGAAGCCCCCCCAACCGTCTGAATGGACCCCTCTTCTTGGCCAAGAGC 
ATTC CAAAGTTAACCTGAAAAAACTAGTTCAGGTCATGATGGAAGGGAAG 
GTTGGACATGCCCCAGTATACCCTTCTCCCTTTTGGAATTCAGGAAAAGC 
TGACCAGCATTAACATCAACACAGACCTTATGTCTGATAGGAAACTTTGA 
CAATCTATTCCCTCTGAAGCTTGCTACCCGGAGGCTTCATCTACAAGATA 
AAACCTTGGTCTCCACAACCGCTTATCATAACCCAGACATTCCTTTCTGT 
TGAGAATAATTTACCTTGTAACCTGGAAGCTCCCTGCTTCAAGTTCCCTC 
ACCTTTCCAGATTGAACCAATGTAAACCTTACATGCATTGATTGATGTAT 
TATGTCTCCCTAAGATGAATAAAAGCAAGCTGTATGTTGACTGCCTTCAG 
CACAGGTTGTCAGGACCTCCTGAGGCTGGGTCACGGATGCATCCTTAACC 
TTGGCAAAATAAACTGTCTAGATTGACTGAGACCTATCTCAGATACTGTT 
GGGTTCAAATATATAACTTATGAAACTAATACACAAATCAAGTCATAGAA 
TATTTCCATCACTCCTCATCTACCCCCAAATTTCCTTATGCGTCTTTGCA 
GTCAACCTCCCACCCCATCCCCAGGCAACTGCAGATCTACTTTTTGTCTC 
rGCACCTTCAACTGACCCTTTCTGTGATTTCATATGAATGGAATCATGCG 
ZTGAGCAGTCTTTTGTGTCTGGCTTCTTTTGCTCAGCATAATGTTTTTGA 
GGTTTGTCCATGTTTTTGTGTTTGTCAATGGTTAATTTCTCTCCATTGCA 
GAG7AGTTTTCTATTGTACATGTGTACCACAATTTGTATATCCATTCCAT 
TGCTGATGGACATTTGATTTGTTTCCAGATTTTGGCAATTATGAATAGAG 
CTACCATGAACACCCAGGTACAAGTCTTTGTGTGGACTTATGTTTTCATT 
TCTCTTGGAATGGAACTGTCATATCAATAAGTATATGTTTAACTTTGTAA 
GAAACTGACAACAAATTATCTGCGATGGTTATGCCATTTTGTTTTTCTAC 
CAGCAATACACGAGCATTTCAGTTGCTCCACAACTTTGCCAAAACTTGTT 
TTCTTTAATTTGGACATTTAAGTGGTGTACAGAGGCATCTCATTGTGGTT 
CTAGTTTTCTTTGCCCTGATGACCAATGGTGTTGAACATCTTTTCATGTG 
CTTTTTGACC^TTTACATATCCTCTTTTGTGAAGTGTCTGTTCAAATATT 
TTTGC C CATTTAAAACATTTGGGGGT TTGTCTTATT ATTGTGTTGGGAGA 

GTTCCATATTTATTTATTTATTGAGATGGAGTCTCACTCTGTTGCCCAGG 
CTAGAGTGCAGTGGCGTGATCTTGGCTCACTGCAACCTCCACTTCCTGGG 
TTCAAGCAATTCTCCTGCCTTAGCCTCCTGAGTAGCTGGGATTACAGGCA 
TGTGCCACCACACTGGCTAAGTTTTTGTATTTTTAGTAGAGATGGGGTTT 
CATCATGTTGGCCAGACTGGTCGCAAATTCCTGACCTCAAGCAATCCACC 
TGCCTCGGCCCTACAAAGTGCTGGGATTACAAGCATGAGCCACTGTGCCT 
GGCCCATATTTATTTTTTATTCTTTATTTTGTATACAAG7TCTTGGTCAG 
ATACAATAATACCTGGTCAGATGAGATAATGAGTTGGAAAATGCTTTGCA 
AATGGGGGAGAATAATTTAAATGTTATTTATTTATTAAGAGCAGAGGCCC 
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TTCCTGTTGCGGTCAC.^AAGCCGTTTGCTTCTTCTGCCTTTTAfiOU^J 
AGCAGAG7CGAGC7ACACAGGC7G7C7G7G77GGC7GC7A77AG77AA7C 
AGAGAGTTTTTTTTTTCTTGCCTTGTCATTCTAATTTGTGACACATAAT7 
AGCCLACAATATGTGTTTTCAGTTGTGACACTGGCC7GGGAAACCAAGGGA 
TGTTTAGAGTGGATTTCCTTGATTTTGCAATAATTGTGTGTTTTTCTGCA 
TC77C7G77AAACACAAA77CA7GGAAGCAAAACA7GGAAGCAAAG7ACC 
ZTGGACATCCCCCCTTCTTTATGAAATTGATTTCTCTTAAATGTAATGTT 
TGC77G77CCC77AC777AAAAGCAA777AAGAG777A77GAGAAAG7GA 
GCCC7GGAAACA7AGA7GCA7AGAGAGAAAA77C7ACCACCC7CAGG7CC 
C7A77G7C77C7C7CA7AAAG7G7AG777CAGGGCC7777AGAAG777C7 
TTTCTGCTCTGATTTGCATGTTTGTGAGTGTTGCTATTTTAAGTATTTGG 
ATTTGGTCTGCAAATCCTATGAGAGATGGCAACAGAGTAGGGATCTCAAA 
GC C7G CAGGTTGTATTAAGTC CAGCAGGG C CTTGTATTTACAACAG AGGG 
TCCTTGAAGACATTCCATATATTATGCTAGGGGAGTGGCCAAGCAAACTT 
TAATGTGTCCCTATGGTGGGATATTTGGGGTTAATACCTGCCCTTCTCTT 
AATTTCTTTTTCTTTTCTTTTTTTCTTTTTCTTTCTTTTTTTTTTTGAAA 
TGTAGTCTTGCTTTGTCACCCANGCTGGATTGGAGTGCAGTGGTATGATC 
TCAGCTCACTGCAACCTCCACCTCCTGGGTTCAAGCAATTCTCCTGCCTC 
AGCCTCCCAAGTAGCTGGGACTATAGGCACACACCACCATGCCTGGCTAG 
TTTTTTTTTTTTTTTTTGAAACNGAATCTCGCTCTGTCGCCCAGGCGGGA 

CTGCGGAC7GCAGTGGCGCAATCTCGG 
>Concig3 9 

CGC7CGCA7CCC7CA7A7CCA7GAG7G77C7G7GGGCCC7GCC7C7GAAA 
TAAA7CG7GCC777G7C7CCCAG77CAC7CCAGCCACCCA7CC7GGGGC7 
GCACCCTCCTCCTTCCAAGCCCTCTCCCTTTCCTTCCTGG7GC7GCCTGT 
CATG7CAAGCATATGCATCAGTGCGACCAGGACATTTGAAATGCAACCAG 
7ACAA77GGGCGCGG77A7GCC7ACCAG77T77C77CC77AAACA7777A 
7A777A7G777GAAAGCA7GCCACC777C77CAC77GCCAAC77GACAGA 
777A77AG77GACAACA7CCGC7GA7AGCA7CAG7AA7AAG77AA77G77 
7T7GCACA7G7AGC77TAA7TA77C7CA77A7CA777A7AGGAG77A77C 
7TTG7AAAGGG7AAC7GAG7777CCAAAACAAACAGAAA777GGGG7GGG 
CCCA7GGAGCG7GAC7CA7GAAA7CAGA77C77AGAAGGACC7CGGCAAG 
7C 7C7 GGG77GC7G77AA7GAGCC7GGC7GGC7GCCAGGGG7G7G7C7GC 
CC777A7GAGGCCACCAC7G77CAAA7GC77GCC7GCAGCA77AC77GCC 
7AGG7AG7GC77G777C7AC7GAAC7G7CAGGGA7CCAA77C777G7GG7 
C7AAG7AACAA7AC7CAGA77CACAAGGAA77GA77AA7AAGCCAGAA7G 
CCAA7G7A77ACA77777GA7GAAGACCA7A777ACAG7GA77G7A7C7G 
C7CAAGC7CAAA77AGGA77AGAG77C7GACAAA7ACA7A7G7GAGAAG7 
A7GAGG7TAAA7AC77GAAA7T7GGAC7777C7AGAAAA7C7GAA7G7GA 
77GCCA77CACA7ACC777C7GGGGA7GA7GA77C77G7AC7777A7777 
AAAAGACA7AGAAAAC7AAC77AAGAA7CAGA77GC77GGC7GGGCACAG 
7GGC7CA7GCC7G7AA7GCCAGCAC777GGGAGGCCAAGG7GAG7GGA77 
GC77GAGC7CAGGAG7TTGAGA7CAGCC7GGGCAACA7GG7GAAA7CCCA 
TC7C7ACCAAAAA7ACAAAAAAAAAAAAAAAAACAACCAAAAAGAA7AAA 
77AGC7AGG7G7GA7GG7GCG7GC77G7AG77CCAGC7AC77GGGAGGA7 
GAGG7GGAAGAA77GC77GAGCCCAGGAGG7GGAGG777CAG7GAGC7GG 
GG77GCAACAG7G7AC7CCAGCC7GGGCGA7AGAG7GAGAC7CCG7C7CA 
AAAAAAAAAAAA7CAGA77GC777A77GC7GG7777C777C7AAAAC7GA 
GA77GGG7CCCA7CA7CCCC7GGCCCCCA77GG77AA7GG77CC7CC777 
G7C7A77GAA7AAAA7ACAGA7G7C7GC7777GGCAACA7GG77GAA7G7 
AGACAC7GCAGGG7C7TCC7GAC7CAAAA7GAG7AAGGC77AGA7AAAAC 
ACA7777GAAA7GCA777C7GGA7GAACAGCAAGGAAAGGAGA7C7C77A 
AAA7CC7G777C7G77CCCC7C7CCC7ACCCCC7CCAAG7GGGC77AAG7 
AGGAAGGG7GG7GAGCGGCAGG7AAACACACG7CAAAGGCAG7C77CC7C 
7C7GAGGGAAAACAC77G7A7AAGCA77GCAA7CAA7GGGCC7C777AA7 
7A7G7GCGAG7GGCAAGAGCGGG7GC7GAACCCAGGGGCC7GCC7CAA7C 
CGGGGCC777GAGGCAGAA7AAAG7GG7C7CAGG77G77GGCA777CC77 
GCCC77CCACCCGAAGCAGACACAAA7CC7C7C7GGAGGCAAG77CCCCA 
A77CAGCCAG7ACAAC7CCCACAGAC7AAGA7CAA7CA7G7ACAAGC7CA 
CAGACAAAGG7CACCAAACACACAGAGCAA7AAACAAA77CA7GAG7GAC 
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GTGAATGAGAATAAACAi_*AACAATAACCACCAGCTGGGATGC*d^AAG 

C77CAGC7C-77AGAA77CC7GAA7A7AGAA7AAAAC7GCCACAA7GGCAA 

ACATGCATCTAGTACTTACTGTGTGCTGGGTTCTAAGAATTTTGCACAT*" 

GTGCCAGATACCGACTCAGCTTCACACTCACCCTCCTACTGTGCCC^CT^ 

AATTTGCACTAGATTAAAAGGTAGAAAGGAAGAGGCAGCTATTCTGTTC" 

TGGCTGTGCCTCTGGCAGCACATGCAAAATGGGCAGTAACAGTGGCAGTC 

ACAGGTAAGTAGCCTTCTCACAGTGTGGAGTTAAAGGCATGGGACTGAGA 

CGAGCAAGGTTCCTAAAGGGACAGTGGCCAGTAGATGACCAGGGGCTACT 

GGAG7GGC7GCA7GGC7C7G7GGAAGC7CAGAGGAGCC77GGG7CC7GCA 

GGTGCAGTAGCAGCTTTCTGTAGTTCCTGATCTCTGGGTCCCACAATCTT 

CCCCGTTTTTGCTCCTCCACT7CTAATTTTGTAACTGACTTCCCTG7GTG 

TAC77C7C7C7C7GA77GAAA7AGCCAGAC7GG777C7G777CC7GA7AA 

GACATTGTCTGGTACGAACACAGTAACTCATTTAATCCGATATCTCTATG 

AAGGAGG7ACAATAATTATTCCTATTTTACAGATGAGGAAACACAGCAGA 

GAAA7AAAGTCAA77G7C7AAGG77GCACA777AG7CAAGGGAAGGG77G 

A7A7AACATA7AA77A777AGAAAACA7C7AAGGAAATAAAAGGCA7AA7 

TTAAAAATAAAACTAGGCAGG7T7AAAAAAATGAAGTAATC7ATAAGTAA 

AAAAG7A7AA77GTTGAAA7ACA7ATC7TAG7GGA7GGG7TAAA7AGC7G 

AAGAAA7GA77AA7GAAC7GGAAGG7AG77C7GAGGAAA7CAGAA77CAG 

CA7AGA7AGAAAAAA7GGGAA777ACAAAAG7ACACAGGAA77A7AAAAG 

AGG7TAAATTATAGGGAGGG7AGAATGAGAA7TAACATTGGTCTAAC7GG 

AA7777GGAAGAAGAGAATAGAGAGAA7GAACAAGGCAA7A777AAAGAG 

37GGCTGAGAAT7777CAGAACCAACACAAACTA7GAC777ACCAG7AGA 

GAAAACAATG7ACAC7GAGGAGGA7AAA7AAATA7ACTA7GAACAAA77G 

TAA7AATAATAC7CAACAAAGACAAAGAGAAGA7GTTAAAA7CAGCAAAA 

AAAGAAAG7CAGACTTAGAAAGAAATGACAA7GGCAGAC7AC7CAACAAC 

AACAA7GGAA7CCAAA77CGG7CAAACAG7A7777C77CA7GC7AGCATA 
TAGC 

>Contig40 

GGGAG7CCGC7A7GC7CC7AAAGA7T7GCACC7C7GATC7GG777GTAG7 
7AGTC7C77T7A77GC7TTATCC7AC7CAAC7AAT77777TAG7GCCTGT 
7TTT7777T77TTAA7GTG7GT7GATGAC7ACAAT7C7AAAC7CA77C7A 
CTGA77CA7GGG7GCTr7AAAA7C7GAGCAG7C777CGCA777AC7GCC7 
G7GA7GGCCCA7CCCACCAGC7AAAGTG7G7GGCCAC7GC77ACAGCACC 
A7G7GA7AACGAG7AAGGGAGAGA7GCCGCCCAGAC7C77C7AGGAGCAG 
CCAG7AGGACC77CCAGGGG77GCAAGCAAACCACAGCAA7A7G7GGAG7 
G7GGCAGAGGA7GGCCCCAAGAGGA7G7GGCAGCGGC7AG7GCAGC7CAG 
CT7AG7C7GAGAGGAAA7GC7GGAGAGGAGAGCCCAG7C7G7ACAGGCA7 
GACAGCCACAAGGAC77CAACAGC7AACA7GGC7GAG7GGAC777A7G7G 
CTA7CTCAT7CAGAAAACAGGAGCAA7CAGAAAGGAG7CACC7CC7A777 
G7ACCCCAGGAA77GC7AACC7AC77GCA7C7GAA7GA7G7CCA7CAC77 
CCC77CA7CACC7CCTC7GGGGGCTC7GCAAGGAT77GAC7CC7GCATTA 
G7GA7C7G7C7CACC7ACG77G7GA77CACA7GAAC77AC7AA7G7GC7A 
TG7GACAAC7ACCA7C77AAACACAAAAACCC7C7777GA77C7G7GGC7 
CCC7CCAGC7ACCCCTGCATTTCTC7G7CCCCC7GCCCCGTCTC7GCAC7 
CACTT77A777TACAGCAAAACTAC7CAAGGGAG7CTCAG7GCTCCT7GG 
C7CCA7G7C7CCACC777CA77C7C7CC7CAG77CAC7CC7G7CAGGC77 
CCGTCC7CAAGCTCTTCTTCACT7TTG77C7AGGGCCGC7GACA7CCTC7 
T7C77GCCAAA77CAG7GGCCAGG7CC7CAC77AC7CAAC7GC7CAGCA7 
7G77GGGCC7GG7GGACCACA77C7CC77CACCCACC7777GC7GC7C7C 
7CTTC7C7CCAGATGTTTCTC7CTTCTCAC7GGCTAC7CCTC77TTGTCT 
CCTTTG77AGC7CCA777CTTCCTTCCAACC7CACTG7GC7GG7GTGCCC 
AG7GC7CAG77777AGC7A77C7C7C7777CCAG7GGCA77CA77AGA7G 
GTA7CA7G7GACCCATGGCATTATA7GCC7TC7ACA7GACAGT7ACTCCT 
GAA7A7GAA7C7CAGGAAAGA777GGA777A77777AA77AA7777777A 
AA7777A7777AA7AAA7GAGG7C7C7C7C7G7CATCCAGGC7GGAG7G7 
AGTA77GAG7GATG7GA7TA7AGC7CAC7GCAGCCT7GAACCA7GGGCTC 
AAG7GA7CC7CC7GCC7CAGC77CC7GAG7AGC7GGGAC7ACAGGCA7G7 
GCCACCA7GCC7GGA7GAC77777G7G7G7G7G7G7G7G7G7GGAGACAG 
GG7C773C7CTA77GCCCAGGC7GATCACAAAC7CC7GGCC7CAAG7GA7 
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CCTCTCACCTCAGCC7L .CAAAG7GC7GGGA77ACAGG7G7GAG&C(^ J ; * 

CTGGGCTAAGATTCAGATTTTGTATTCAATTGACTGTTTGACATCTTCAC 

T7GGACACC7AAGAGG7A7C7CAAA7A77AA77AAC77GGCCAAAA7ACA 

GAACTTTTGACCCCTGCCCCCACAATACTTGCCCCTTCCCCAGAC7TCTC 

CA777C7G77AAA7A7CCCCAG77AC7CAACCC7CAAACC7A7GAA7GCC 

C777GA777C777C777CCC7CA7C7CC7ACG77GACGCCA7CAGC7AG7 

777G77GCC777A7GCCCAGAA7A7AA7CC7CACCACC77C7C7CC7A77 

GCCCGAG7A7AAGA7G7CAG77777CC7GCACAG7CCA77GCCC7GACCT 

CCTGAGTGGTTTGCTTCCACTTTTGACATTTGTATTCCTCTTTCCCCCAG 

GG7CAA77777CACAGCAAGAG7GGCA7777777777777777777777G 

AGACGGAGTCTCGCTCTGTCGCCCAGGCCGGACTGCGGACTGCAGTGGCG 

CAATCTCGGCTCACTGCAAGCTCCGCCTCCCGGGTTCACGCCAT7CTCCT 

GCCTCAGCCTCCCGAGTAGCTGGGAATACAGGCGCCCGCCACCGCGCCCG 

GCTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCTTGTTAGCCAG 

GATGGTCTCGATCTCCTGACCTCATGATCCACCCGCCTCGGCCTCCCAAA 

GTGCTGGGATTACAGGCGTGAGCCACCGCGCCCGGCCAAGAGTGGCATTT 

TTAAAACCATATATTAGATCATTGCTTTTGTGTTTGGGAACCTCCAAGGG 

CTTTGCATCATATATCAAGTTGACACCTCTCCTACCCAAGCCTGGCTCTT 

TCCTGCTCCTCTGTCCTCTCAGCCCCTCCACCCATTGTTCATGCTGCT7C 

AGCCACAC7GGCC77C77GCCA7GCCACA777G7GC7AAGCCCACA7CCA 

A7C7CGGGGCC777GCAC7CGCA777CC7C7GC77GGCA7GC7G7ACCCC 

AGA7C777CA7GA77GGCAGC77C7G7ACA77CAGCCACC7GC7CAAGCC 

ACCC777CAGAGGGCC77CCC7GGCCACC7CACC7GAAA7AGCACC7CCG 

A77GCACCCA7CCGG77A77C7CCA7CC7G77C7C77GC77GG7GA7777 

CCA7CAC7GA7GAGGAAA7GAACCA7GGAA7GC7AGGGC7GA7GACCAGA 

AC777CCCCCACCCCCACA77A77ACAGAGGAGGAAA7GAGG7CGGAGG7 

AAGA7GGGCCCAGGA777C7AC7CCCGCC7GGAC7GCAGGCACAGCAC7G 

ACC7CAGC7G7GC7CAC7C77GGCA77CACCCAACCC77C7A7C7CCAAC 

7GCCCCA777ACCAGAAAG7GAAA7G77C7CAGAGACGG7GAGCCACC7G 

AC77GGACAGCAGCCCAGGGCCCC7GGCACCC7GC777C77CC7CCC7GC 

CA7CC777CC7C7CCAAGACC7ACC777CCC7G7GA77C77GCCCACA7G 

C7GCA777CA7GG7T77ATGACC7GA777C7GAGAGGGA777GAA7777C 

A7GA77A777A7GTAAGCAAA7CA77A7GC77A7ACAAA7GAGAAAAGGA 

G7GC77C7GGAC77CCCAGGGACAAAA7C77G7CAC77GGC77GC777CA 

7A77GC7AA77AAGGACCCAGGA7G7GGG7GAGA7G7GC7AAAAGC7GAG 

AGGAGGC7C7GGAC7C7GAC7A7GGGCCCACACCCC7GGGCAGGCA7CAC 

AC7AG7CC777AGG7CA7CC7CAACCCAGC77CCAG77GAA7CAGA7G77 

7G7GAA7AAC7CAGCAAGGC7G7A7GGGAAA7GAAGAA7GAGG7GGGGAA 

GAGGCC7G7GCAGAAGACACAC7GAC77ACCCC7C7ACC7C7AAC7AGGG 

TG77G7AGCAGCCACCCACCCACCAAG7C7G7C77CCAGACCACG7A7GC 

777CC7CCACC777GCA7C7777A7C77C7GCCAGCCCAGA7GC77GC7G 

AC7CCAGCCCAAGCC7A7AGGA7AAGC7ACAGCC7G7CCC7ACAGAC7AC 

GCA77GCAGAA7C7AAGACA7CAAG7CAAG77CGGAAGCAC77GCC77C7 

CC7C7CCAGG7ACACAGGC7C7CC7GGAAAGC7GG7AGCAGC7G7GGAGG 

7G7GG7G7G77ACC7GC7GCAGG7GCAGAGAAG77GAC77CACAGCCC77 

CAGAAAGAC7GCC77C77CCAG77G7A777G7G7AC77GC77GGG7G7GG 

GGAGGA77C7CAGCTTTCTCCAC7CAAA7TA7CAGACCC777CCA777AG 

7GG7AGACCA777CCC7CG7CCAGGCCAAGGGCACA7AG7ACAGAGAAA7 

AGGGAG77G77ACCCAGGGAGAGAAC77GGC7C7AAACC7G7AATAGAAA 

GG7CAG77C7GG7C7GGAGGG7CAA7777GA7C777GGC7CAGA7CCAGG 

AA77GGAACCAAGGC7777GAACA7777AA7GCAGGGGA77AAAAAAA7G 

A7ACGAG7CA77CACGAA7A7A777GC77AACA7C7AAAGAGATCCC7CA 

AAACAC7AGAAAAAA7AAGAACAAAAA7C7AA7AAAACAAAA777G77AA 

ACACA777ACCAA A 1 T 7777T7777GG7AAAAA77CAAA7G7CA7AAA7A 

AAGC7AAAG77CC7C77GA7GAC7CGC7CC7C7GCCC7A77CCAC7CCAA 

G7AACCAC7A77A7CAG7C77GCCAA7ACCC77CCAGACC7C7C7ACC7C 

7A7A7ACCA77AGAAGCACA7GG7777GCA77GAGGA7G7GCAG7G7777 

G7777ACG7AAA7G77A7CAC7C7G77C77G77CCA7AA777GCC77777 

C7C7CAA7GA777GC77GGC7A7C777C7A777CAG7AGCA7C7CC777C 

T7777AAC77ACCA77G777A777AACC77GCC7C7A7CAACAGA7A7G7 
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AGG77G777C7AG77GA * TTCATTAAGTATTTATAAACAACGbfiTCAG'i W 

3ATG7CCATAAATTTCTTTACGGAAGATGGCAAGTAGTGGAATTGCTGAG 

CCAAAGAACATG7TTAAAAAACCCAAAAAAACTAGACGCTACCAATTT' T, C 

TCTCCAAAATGGCCATACCCACTTACCCATACAGAGATGATTTGGAATC* 

GGC7TCCTCACAAGG7GAGATGCCTTCACAGTT7CATTCTTCCTGGCATG 

TC77CCC7777G7A7C7GAGAGAGC7GGCAGAA77G7G7CAC7AAA7CAA 

GGATAGAGGGTCAAATGACAGC7CAAGC7CACAGGCACC7C7GC7' p ' r C'^ , 

CCCAGACCACC7GC777CC7GCCACCAGC7C7G77CCA7C77A7AGAA7G 

G77GCCAC77GGG7G7C7GC7CCGACAGCCA7G7CA7CC777GCAC7GCA 

G77A7GAAGCAGACAGAGC7AGGAGAGGGGC777GCCAGCCTC7GCCC T A 

GC77GGAGAA777CAAAGAAGGAGGG7A77GAGAG7GAGC7GCCGAAGAC 

TGGCAGCTCCC7CAACTCAACAG77G7CCT7CCACAAGAAG7CAGA7ACA 

T777777GGGA7AAAA7A777AAAAA77A77A77T7A777CTGAA7AATA 

7A777ACATGA77CAAAAA7CAAAC7G7AGGCCAGGCA7GGC7GC77A7G 

CC7G7AA7CC7AGCAA777AGGAGGCCGAGGCGGGAGGA7CAC77CAGCC 

CAGGAG77CAAGACCAGCC7GGG7AACA7AG7GAGACCC7G7A7C7ACAA 

AAATTTAAAAACAAAAAT7AG77GGGCATGG7GGCTGA7A7GG777GGC7 

C7G7GACCCAAC7CAAACC7CA7G77GAA7777AA7CC7CAA7G77GAGG 

GAGGGTCC7GG7GGGAGGTGA77GGATCATGGGGGTGGG77CTCCC7' r GC 

7G77C7CA7GA7AG7GAG7GAG77C7CACAAGACC7GG77A777GAAAG7 

G7G7AGCACC7CCCCC77CAC7C7C7CAC7C7CC7GC7CCGCCATAG7AA 

GA7G7G7G7G777CCCC777GCC77CCGCCA7GA77G7AAG777CC7GAA 

3CC7CCCAGC7A7GC77CC7G7ACAGCC7G7AGAAC7G7GAA7CAG77AG 

ACC7C7777C77CA7AAA77ACCCAG7C7CAGG7CA77C777A7AGCAG7 

G7GAGAG7GGA7GAA7A7AG7GCCATA7G777G7A77CCCAGC7ACCCAG 

GAGGC7GAGG7AAGAGGA77GC77GAGCC7GGGAG777AAGGC7GCAG7G 

AGCCATGAC7G7ACCACTGC7C7CCAGCC7GGG7GACAGCGAGACC77G7 

C7CCAAAAAAAAAAAACCCAAACTGTG7AAAA7G7G77CATAAAAG7G7C 

7TGC7CCCACACC7G7CCC7A7A7A7C77A77CC7CAGCC7CCGACAAC7 

ACT7TAT7CA7TTCTTA7GTA7C7TCCAGAATCAAAAAAAAAAAA7CAAA 

7ACAAGCACAG7GGAA7G7AT7GCCC77C77CCCC7CCC7TTrG7TACA7 

CAGAG77AGCATA7CA7AAA7ACGG7C7GCA7777C77C77777CAGC7A 

7CAGCA7GTT7TGGAGAGGA777CA7A77CG7GCAGACAGCA7GTA77AG 

TCAG7C C77GCA77GC7ATAAGGAAA7AC C7GAGAC7GCA7AA777A7AA 

AGAAAAGAGG7T7AA77GGC7CACAGC77CGCAGGC7G77CCACAGGAAG 
CA7GGCAGCA7C7GC77C7GGGGAGGCC77AGGAAGC7777AC7CA7GCA 
GAAGACAAAGCGGGAG7GGA7G7C77A7A7GGCAGGAGCAGGAC7GAGAG 
AG AG AG AG AG AGAG AG AAAGGATG C CACA7AC7777AAACAAC CAGATCT 

7G7GGGAAC7C7G7CACGAGAACAGCACCAAAGGGA7AG7GC7AAACCA7 
7CA7AAGAAC7CCACCCCCA7GA7CCAA7CACCCCACACCAGGCCCCACC 
7CCAACA7CGGGGA7TACAA777GACA7GAGA7TTGGGC7GGGACACAGA 
ACCAAACAA7ACCAGAG7GC7TTCTCA7TCT7T7C7ATAGC7GCC7AG7A 
77C7A7G7CC77TAC77CA777AGGCAG7C7C77G77GA7AGACAC77GG 
G77AC7TCCAA77777CC7AT7ACAAA7GA7G7GCAA7GAA7AA7777GA 
7CA7777CCA7T7CACA7GGG77A7GTCCA7C7GTGGGA7AAA7CTCCAG 
GAG7GAAA77GC7GGA7CAAAGGGGAAG7GCACT7G7GATTT7CA7AGTT 
AGCAAA7777G7TC7A7AAGGG7CATA7GAA777A7AG7CCCACGCG7AA 
TA777AACAGTGGGGA777CCCGACAG777GACCAACAAGG7CTG77G77 
AAAC7TT7GA77777G7CAA7C7GATGGGAAAATAC7AGTATC7CAAAGT 
GC7777AA777GAC777CTTA77ACAA7G77AAGCA7CA7777AC7C7GC 
CCAAGA7CAAA7AG7A7777C777TCTG7GAACAGAC7G77AAGA7CCC7 
TGCC7CT7G7777GC7GGA7777TGT7C77777777CAAA7G7777GAGG 
CAG77C777ACA7G7GAAACAAG77A7C7CT77A7C7GGGG7G7GAG77A 
CAAC7AC7777CC7C7GGC77G7777GCGC777GAC777GC77C7GG7GA 
T7CCCGCAAT7C7GAAAG7G7ACT7777GCA7CA77CA77C77A7ACACC 
CA7GC7C77G77CACGC7GG77CC7C7ACC7GAGGGC77777C7777C7G 
C77C7A7C7GGGAACA7777777GAGAGAGAG7C7CAC7C7C7CGCCCAG 
GC7GGAGTAG7GCAA7GGCGCGATC77AGC7CAC7GCAACC7CCACC7CC 
7GGG77CAAGCAA77C7CC7GCC7CAGCC7CCCAAG7AGC7GGGA77ACA 
GGAGCCCACCACCAAGCCCAGC7AAT7TG77GAT77A777A777A77777 



FIG. 3 (41 of 52) 

43/// ST 



WO 99/06426 



PCT/US98/16102 



TG7AGAGATGGGAGTC. ^CTATGTTGCCCAGGCTGGTCTTGAAiTCC . J 

GGCTCAAGCGATCCACCCACCTCGGCCACCCAAAGTGCTGGGATTACAGG 

CG7AAGCCACCA7GCCCAGCCCA7G7G7GGAAA7C77C7G777A7CCC77 

7AGGC77GA77C77A7G7CG77C7CC7CCC7CC77CC7GGA7AC7CC7C7 

7G77C777A7C77AC7C7AC77G7CA7G77ACC77G777C7GC77A7AAC 

7AGC7GCC7C7CC7A7C7GAGGAGGGAC77G7GAC7G77C7CA7C7C7G7 

AC7CC CAGCT C C7AG7ACA7AG CG CTTG CT CAACAG A7G77TGGTG CA7T 

GA7AGA7AAA7CAC7GG7AGC7G77AC7ACCAG7CC7GAC7CCC7GCAG7 
GC77CAGC7GA7CC7G77CCAGA7G7GCAC7GAA7A7CC77C7G77GAAC 
AACAGAAA7AAAGGGGA7GGG7GAGGAGGA7AG7C77CGG7GGCCAAGGA 
7A777T7AGG7AC777GCAGCAC7CAGCAA7GAGGAG7GGGC777AG7CC 
CCCAAGAAC7C7CACAGCCC7GGG7G7C777AC7G77CAG7G7CAAA7CC 
AAGACAAG7CAA7GA7CAGGAAAGACCA777777777G77CAG7GAAG77 
7A777CAGAA7CA77GAACAG7A7GA7A777GG7AA777CA7AAA7A77C 
CCAC77AAAA7GA7CGGAGCAGA7A7A7777CAG7CG7AA77AAAGGACA 
TGA777AAAGAGAGCACACCAG7CCAAA77GAAA7GA77CCA7AGC7A77 
AAAAAAC7AGGG777777ACAGACAA7GA7AC77777GCCCCC777GAA7 
AG ATTAG AC CAA7GAA7AAAACAAACAAACAAATAAA7AAA7AAATAGGG 
AAGCGG77GC7CA7CAGAA7G7GGGAGCGAA7GACAGAGGG777C77AGA 
ACCAAA7G7GGCCG7GG777C7G7CAGGCG7GC777AAG7GAG7AGGAGA 
GG7GAGAGAGGCC7GGC7CAACAAAAGGGC7GGGGA77G7CCC7GAAGAA 

CCAGAGC7GAN77NCA7CAGGAG7AACANAGG7AGA7AG 
>Cont:ig41 

CCGCG77GAGG77CCACGCAG77CAAA77A7G7CCAA77A7CAACA77AA 
TGCACA7777CAA7AGAACC7G77CCGGC7777C77AGGAGGGGGGCGGG 
GAGACG77G77C7C7GGGAA7AAG7G7ACGCAGGAGGC7GAGAAGGC77C 
AT7CCA7AGCA77CAC77ACC7CCAGC7G7AGAG7GGGC77A7CA7C777 
CAACACGCAGGACAGG7ACAGA7777777C777GAGGCCCAAGGCCACAG 
G7A7777G7CA77AC777C77C7CC77G7ACAAAGGACA7GGAGAACACC 
AC7GAAGAAAGAAGGGGG7C77G7GG77AGGGACACAGCAG7GCAGGG7C 
ACCCCAACCCC7AGGCCCCA7GAG7AGGA7ACA7G7AA777GG7AGCC7C 
7G7GGGAACCCACAG7GAGG77CC77GGCC7AAGACACAGGA7AAC77GA 
C77C7CACAGACAA7AGCAGGG7CA7777G77GA777AGGG777CCCC7C 
AAAGGCC7GAGGG777C7CAGAGCC7CA7AGCAG7AGGAACGGAGAA7GA 
AAGAGGG7C7ACA7777AAA7GC7GAAGGAAGGAAGGAAGGAAGCCA77G 
7G7CAC7GGC7GGCAA7G7GCCCA7CCACAGGAGCGGAACAAC77GA7CA 
A7G7GGAAGGAAAGGAAAGAGG7GAGGC7G7AC77C7GCCAGAAA7CAGG 
GACCAGAAC7G777CAGGAACAGAGAG7AGCCCA7GGGAAGAAAC7GGGA 
GAGGAGAGGC7GAGC7GGGAAAG7GGC7CCAAAGAGAGACAC7CA7777G 
A7C77CC7CAG7CACAGCAG7G7CAA77GGAGGCCC7GGGA7CAC7C77A 
C7ACCCGA77CCAAAGAAACAGGA7777C77GGCC7GGC7GAGAGCAAA7 
AGC77CCCCC7GAG7GAGGC7G7CC77CAAAG7CAGCAGCC77AG77GCC 
CACAC7CC7G7GCAGAGGC777GGC7AC7G7GGCACGA7GCCAGGCAGA7 
CACCACAGC7AA7GA7GGG77CACCGCAC77GAAAC7777GCCCG77ACA 
GCGGAGAGA7A7AAG77CC7GC7GGGCGG7AAAA777CCC7ACAAGGAAC 
CACC7GGCA77GGG7GGGACGGA7G77GGGGCAAGGGGGGAAGAC7GGGG 
AGGGGGA7GGACACA77A7CGC7CCAGCAC7C77G777CAGCC7CAACAA 
CAGGAAGAGAGAACCCACAGGCAG77AGGCCA7G7CCA7CAAA7GACCCC 
A7A77G7GGAAGAA77GACA77GCAC7A7GCCCAAGAGAC77GGG7GGAC 
A7GG7CC7GGGAG7GC77GAGCCG7C7AA777C7CAGGG7CACAC7CC7G 
T7AAGAAA7GCAC7GGCCAG7GCAA7CAAA7G7GCCA777C7AGGACGAA 
AG777G7A7A77CC77777AA7A77777777CAC77G7G77GA7CA777G 
CC77AAA77AAC777C7AC777G777AAAACA7GGAGAA77AGCAAGC7G 
CCAGGAGGCCAGGCAGGGAAACCAGGA7G777CCA777ACC77G77GC7C 
CA7A7CC7G7CCC7GGAGG7GGAGAGC777CAG77CA7A7GGACCAGACA 
7CACCAAGC777777GC7G7GAG7CCCGGAGCG7GCAG77CAG7GA7CG7 
ACAGG7GCA7CG7GCACA7AAGC77CG77A7CCCA7G7G7CGAAGAAGA7 
AGG77C7GAAA7G7GGAGCACA7G77G777AGG7A7AAAA7CAGAAGGGC 
AGGCC7CG7GAGGCGAGG7GGCAAAA777GA777C77GGAGGACACC7GA 
GCA7A7ACGG7CAAAG7C7GA7GACAACACCAG7AGGGA7GAAGC7GGGA 
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G7GGGG7GGC7AAGAAC. .CTGGACCTGACACTATTAGACATGGGSSTCC. _J 

CTTCAGGTCTATTACTGCTCACTGTGGCCGAGCAACAGAGCTACTTAGGT 

AAAATGGTGATGGTCATAACACTAGCCCACAGGGAGGTTACGAACCTCTG 

G7GACAA7G7AAG7GAAAGGCCCC7GAGAAAGAG7GAGGGAG77GCAAA7 

GTCAGTAGCCATCAAGATCTTCTTTAAGAATAGTTTCCACTAAAGAGATG 

A77GC777GG777CCAGCC77C777G7777G7C7CCCCGC7GGGCCTTC7 

ACCTTTAAAGGGCTTTGGCTCTGGGGGAATTGAG7TGGCTGGGGCTTGAT 

GAC77CCAAGAGGACACAAG7GGAGA7C7AC7GCC7GC7C77GGC7AAC7 

ACC77C77CAAAGA7GAAGGGAAAGAAGG7GC7CAGG7CA77C7CC7GGA 

AGG7C7G7GGGCAGGGAACCAGCA7C77CC7CAGC77G7CCA7GGCCACA 

ACAAC7GACGCGGCC7GCC7GAAGCCC77GC7G7AG7GG7GG7CGGAGA7 

TCG7AGCTGGATGCCGCCATCCAGAGGGCAGAGGTCCAGGTCCTGGAAGG 

AGCACTGCGGAGAGAGCGAGGGAGGGAGCC7GGTGAGGTGGTCCTGCCAG 

GAACCA7GC777GACATCAGAGAG7AGAAAGC7CAGAGAGGAGGAAAGGG 

C77GAAAGAA7CCCGAGCT7C7AAAGA7CA7CCC7C7C7GGGCCAGGCG7 

GG7GGC7CA7GCC7G7AATCCCAGCAC777GGGAAGCCGAGG7GGA7GAA 

7CA777AGG7CAGGAC7TCAAAACCAGCC7GGCCAACA7GGCGAAACCCC 

77C7C7AC7AAAAATACAAAAA77AGC7GGG7G7GG7GGGG7GCACCTG7 

AA7CC7AGC7A77CAGGAGAC7GAGGAAGGAGAA7CGC77GAAC7CAGGA 

GG7GGAGGA7GCAG7AAGCCAAGA77G7ACCAC7GCAC7CCAGCC7GGGC 

AACAGAG7GAGAC7C7G7C7CA7AAAACAAAACAAAACAAAACAAAACAA 

AA7AAAA7AAAA7AAAA7AAAAAGA77A7CCC7C7C7GAAGC7CAAGGAG 

G77AAGGG7G7AC7CAAGGGCACACAGCAGG77AGAGGCAGAC7CAAGA7 

7AGAA7G7GGGC777C7GACACC77ACAGGC7A77C7777AGAA7AAA7C 

CCA777C7AC777G77CA7C777777G7ACA7GCCCCACC7ACACCA7AC 

A7G7A7ACC77C7C7A7ATC77777GTATCCC7AA7GC7G7CACAC7A7G 

A777GC77777CA7GCAGA7GACCA7AACA7777CCA77CACC7A7GC7C 

AC7CAGCAAG7A77CAA7T777C7ACAC7G77C77777777CC77777CA 

7AACAC7G7C7CA7AGGCA77C7GCAAA7CC7G7GAGAG7AC77777G7G 

AAA7G77ACCAC777CC7CT7A77CAGAGAAGC7CCG7A77AAGGC77CA 

CTGAGG77GCC77AAGGCA7GA7AA7GG77CAAAGGC77GAAAGACAG77 

AAAGAGACC7G7AAG7GCACAAAAGAAAG77GAGCAGGAGAGAA77TCC7 

GCC7GGAGCAGAGCCAAGC7GC7GGAAGAGGCAA7GGGGGCAAAGGCCAG 

GCAGACAAGCCAA7GGGCTCC7CCCACAGC7GCAGCCAACAAG77A7GCC 

AG7C77AAAAC77C7AAAGAAA7A7G77777AACAAGA77GAGGAC7GGA 

77A7GAGGC7AGGGGAGGC7A7CACAAAC7GGAA7AAAA7AAAGCCAGAG 

AAAAG7GGC7GCC7TCCAACC7GCACAAC7GACC7AGC7AGGC7GA7GGC 

7GGGCCACC7AGGAAGGC7AC7GAGCA7CA7A7AAAACAGAAGGGACAGC 

AGGAA7A7AACA7GGC7C777G7AAGGA7GAG7C7GAAAAA7GACCA777 

GC7GCCCAAA7GCCC77AGC7ACAAC7GAAAA7A777CAGAAC7GGAGG7 

7GCAGGA7GC7GGAA7C7CAGAGA7CA7CCAGC7CAGCCC777A77777C 

AGA7GAGG7CCAAAGCGGG7AAAA7GAC77G7CAAGG7CAAACAGCAAG7 

GAA7GG7777C777CAAG7C7CAA77CA7C77777G777A7A7CA7C7A7 

G7C77G77G77A7AAGC77CACCCCAGG7AGCAAAAAAC7A77C7AC7CA 

AAAGGGG7AGACA7A7G77AG77C7CAAGA7CATC7C77GG777CAGAG7 

77AAC7CAAG7GA77GGCA7AGGC7GAA7CCA7C7C77AAAAGGA7AA7C 

AAA777A7G77GAAGAC7TGGT7G7C77CC7AC7A7GAAA7GGGAAACA7 

7A7CAC7AC7CC7CCCC7GTCACCACCAAG7G7GGCCACCACCACCAACG 

77AG7GAG7GAC7G7GG7GA7A7GA7GACCAAG7GGCCAGG7CAGCAAG7 

GG7GCAGCC7G7G7C7CAC7GGAAGAGG77AAAG7C777C7AAAACAAAA 

7ACCA7GGCA7CAAAG7GGCCCAGAAC7CCCT7C777GAGC7TTCCCTG7 

G77AGAGCCC77CC77GGG77GGGAGTTAAACCCA7AG7C77ACC77CA7 

C7G777AGGGCCA7CAGC7TCAAAGAACAAG7CA7CC7CA77GCCAC7G7 

AA7AAAAACAGGGACA7G7C7CAA77A7G7C77C7AAACAGG777A7777 

TCC77CCC7G7G7ACAAGAC77GAC7G77CA7AAGAAAC7GCAAACAGCC 

7GCC7C7CAAAGC7GCC7GAAACACC7GGCAAG777CACAG7GA7A7GCG 

CAGAACAG7CCAGAAGGCAGA77C7AGGCC7GGCAGG7GGGCACCC7GGG 

7GC7CCC7G77GGATC77GAGGCC7AACC7C7AGCCCAGCAGAG7CAGC7 

AAAA7C7GAGC7C7CCC7C7CCC7CCAAGCCACAC777GCAAAGGGA77C 

C77G7A77G7GGGC77GGAA7C7777C7CCCCA777GCC7C7GCAGGAAG 
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CCC77GCAACAACACA .TGGATAGCCTCCAGGTCCCAAGGCTGGAGfc 
C7TG7AA7GGGAAAG7AG7CTT7AAA7CAGA777AC77GGCACCC7G777 
GCCACTGAAAGAGGCAATTTAGGGGAAAAATCTGGTCTCCAAGCACAGA7 
AACACTCTACTCTTGAAAGAGGAGACCTGCTCATGTTACTGGTCTCAGCG 
TCTCCACTGACCTGTAATAAGCCATCATTTCACTGGCGAGCTCAGGTACT 
TCTGCCATGGC7GCTTCAGACACCTGTGTAAAAAGGAGAAAATGAGTGAC 
TTCCCCATGACGGC7ACGTTCATGTGTGATTTCTCTCAGCATCGAGTGCA 
TGGCAGTCATGCAAAGAAATGATCTCTGAGTAAATGAATGAATGTG7GAA 
AGAGAAGTCCTTTGGGTCTAGAGAAAAGCATTTGCTAAACCAAACCCCAA 
CTAGCAATGTATTGGCTAGGAGAGCTGGAGCAGAGGCTTTGACACTAACC 
TTTAGGG7GTCAGCTGTTAGATAAGCAGTATCCATTCCCAGAATATTTCC 
CGAG7CA7AAGCA77A7AT7ACACC7GGCA77T77GCAAAAAGC7GAGAG 
AGGGAGGCAGAGAGGGAAGGAGAGGGAGAGACAGAGAAAGAAAGAGAGAG 
AGAGAGAGAA7A7GCATACACACAAAGAGGCAGAGAGACAGAGAGAC7CC 
C77AGCACC7AG77G7AAGGAAGA77AAAG7CA7ACTTGAGCAA7GAAGA 
TTGGC7GAAGAGAA7CCCAGAGCAGCC7G77GTGCC77G7GCC7CGAAGA 
GG777GG7A7C7GCCAG7T7C7CCC7CGC7G77777A7AGCTrrCAAAAG 
CAGAAG7AGGAGGCTGAGAAA777C7C7GT7GAATACC7GA777CACAA7 
CAAG77AAAGGAAAGGGGAAAAGAG7A77GG7GGAAGC77C77AGGGGAG 
'"GGAC7AATAAAC7GAGA7AA77C7C7GG77CA7GGAAGGGCAAGGAG7A 



G 



7CCCACG77AGAAGAA7G777AAA777AG7A7A7G7GGGACAAAG7GGAA 
GACACACAGA777A7ACA7GCACA7AC7777C77CA77CAC77C777G7A 
C7TAAG777AGGAA7C77CCCAC77ACAGA7GGA7AAA7GGG7ACAA7GA 
AGGGCCAA7AGCCC7CCCTGTCTGTA77GAGGG7G7GGG7C7C7ACC77G 
GG7GC7G77C7C7GCCTCGGGAGC7C7C7G7CAAT7GCAGGAGCC7C7GA 
GGAGAAAA77GACCTTTCTTGGCTGGGGCAGAGAACA7ACGG7A7GCAGG 
G77CAGGCTCC7GACGGAGTTGGGGCAACCC7GGAGATAAGC7CACACAA 
CCC7GCAAGACCAGGTGCTGTTACCC7AGCCAA7CTCA7GGA7GAACCAG 
A7CAA7GCCAGATGAGC7CTGCC7AAAA7GA7T7TTTGG7GAAC7C7GAA 
AAG7GGAA7A77G7TTC7GTAAGAA7A7CCA7C7GAGAC7C7A7C7C77G 
GTAA7ACCAAGAG77ATCAGTTTCTCTTTAACCGAGACACCAGCAAAG7G 
CCTGC7CCAGGGTAATGCCCAGGGGAGCCCTCCAT77G7AGAA7GAA7GA 
GAG7CCAGG77A7GAACAG7GCC7GGAG7G7AGGAACACCC7CC777GCC 
TC777GACAGG7C7GCA7CA7AACAC777777777777777GAGACAGAG 
7C7CAC7C7G7CGCCCAGGC7GGAG7GCAG7GGCACGA7C7CGGCCCCC7 
3CAAG77CGGCC7CCCGGG7TCACACCA77C7CC7GCC7CAGCC7CCCCA 
3CAGC73GGAC7ACAGGCACC7GCCGCCACGCCCGGC7AA777777G7A7 
7777AG7AGAGACAGGG7T7CACCA7G77AGCCAGGA7GG7C7CGA7C7C 
C7GACC77G7GATC7GCCCGCC7CGGCC7CCCAAAGTGT7GGGA77ACAG 
GCG7GAGCCACCG7G7CCAGCCTGTAACACT7C77A7AGCAC7GAG77GA 
AACC77GC7CC7CC7GG77CC7CCAGGAAAC7GAAA7C77777GAGCCAA 
G7C7AGCACAG7GCC7GGCATG7ACA77CAGGTGG7AGAG777GC7GC7T 
GAA7GGG7GAA7GGGAATTTGACAGCA7TTTTAT7CAAA77AGTATGTGC 
CAGG7A7CG7GCTCGCTCTGCATTA7CCAftGGGAG7GAGCC7CTGTGCAA 
G7A777GAGACACGAGGGAAA7AGG77CTAC7G7GGGAAAAAGAGCAT77 
CA7GGAC77GC7C7CCAAGCAGCC7TC7GAT777TAAT77GGCTCCCAG7 
A7C77GA7A7CAGGAG7CAGTCACAAGAAC7CCA7C77TAG7AAG7TA7A 
77T7CCACAGGAAATCTAAAAGCTG7TCAACA7G77AGTT7CC7G7GAA7 
77GA7AAGCCA7AA7CCAT7CCTAACAC7GAGCCC7CC7GAAAT77GG7G 
TCTGG7CC7GCAGA7AGCTAAAAGCCC7G7C7GGG7GGCC7AGGGACTCC 
7C7G7777GCC7CCACAGGATCCAC7TTGCAAATTAACCACTGGTTCTCC 
CGT7G7AGGA ACTG CCACCTTCC7CAGAGCC7G7C7T7C77CC77CC77C 
C77CC77CC7C7T7C77777C777C7C7C7C7C777C777C7777C7777 
C777C777C777C777CT77C777C7T7C777C777CT77C777C777C7 
7CC777C777C7C777C7C7C777C7C7C777C7C777C777C777C7C7 
C7CCC7CCC7CCC7C7C7C7C7T7C777C77777C777C7777C7C7T77 
C777C7CTC7T7C777C7CCC7CCC7C7C7C7CT M r777C777G7C7C7CC 
C7CCC77C7C7CTC7C77TC7C777C7C7C7C7C7C7C7CC7AGACAGGA 
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TCTACCTTTATCCCCCr.jGCTGGAGTGCAGTGGTACAATCATGCATTC^x- 
TGCATGATCACAGCAGCCTCAAACCCTTCCTCAGAGTCTTTATGCGGCAA 
CCAGCAGGGTCTGGAGGGTTGGTGGCTCTGTGAACTCTCCTGACAGAACA 
CAGAGA7G7C777GG7C7G77GA7G7GA77ACAAGC7GAACGAAGGAAGA 
TCAAAGCCAGTGACAGGAAGGGAGATATGCAAGGGACCCGAGCATCAGCT 
CTGAGT7AGTCCATTCTGCTTCTGGGAC7TGGGATACAGGTCAGAAACCT 
TGAGCTTCTACTTCTCCATCTTCCAATTGTAGCATCCAGGACCTCAGAAT 
CfGCCAGCTAAGAGGAGCCCTAATGATTGTCTGGTGGGATATGGTGGGAC 
CACAGAGATGAAGACATGAATAGCTATTTGAATGTGAACAGCAGACGAAG 
AAATCAAGGCTAGGAGGGTGGAAGTGACTCATCCAATAGCACAGTGTGGT 
TGAAGCAGCACTAGTATCCAGGTTGCATGAGCCCCTGATGCTTTCGCTCG 
AGGGAAATTTTGGAGCCATGGGGCAATGCCCCCTGACGTAACAGTCTCCA 
CAGTTCTGCCATGTCTCATCCTGGCCCTGTAACCTGGACCCAAATCTGCT 
ACCATCCCATCCATCTCAGGAAGTGAAACCTCTTATGTCAAATAGGTTGT 
GCAACGTATGTATCAGATCCTGTCTTCCCAAGGAGACCGCTCAGGCCACA 
GCACTTCCTTCCGATCCCCAATGAGCAGAAAATATCTCGCTATAAACATA 
GTTGGCACTAAGGGAGGGAGTGGAAGAGTGATGATGATGTAGATGGTGAT 
GTAGC C C CAAGGAAGTGGAACAAG CAGAGATGGGG AGCTGGAAATG C CAG 
GATGCTCCAGCTTTTGGGGAATTATTCAGCTCTTGAGTCACTAAAGCCTT 
TCTCAGCTGCAAGTTCCTCTTTACCCTGTCAGGTCATTCTTCCAAGACAG 
GAGAC7GACATTTATTCAAAGCAGCAAGTGCCCTGATACCATCTTGTGTC 
TAATCATGGGC7TCGCAGCCAGTTATCAAGGTTGATCTCATCTCATTGG7 
C77CAA7CA7777GAACAAGAAGACAAGCAAAA7AA7CA7GGG77AG77C 
T7A7A77A77G7G7G7ACA7GCAG7GA7G7C7G77C777G7AG7GAGC7G 
77CC77CC77G77CACCC7C77GC77AGAACAGAAC7AAGCAA7C7GCCC 
CCAACA7777CCCCAA777CCCA7C7CA77C77GGCAC7GGC77CC7AA7 
A777G7TCT7A7GAG7GA7777C77G7A7CA777CCA7GAG7CCC7C7GG 
GA7C77AAAG7A7GAAAAA7G77G7G7G7ACCCACACC7G7C777G7GGA 
7ATT7C7C7CC777CCC7TC7GC77C7GGGA77A777GGGAA7GGGCAC7 
A7GA77777A7CA7A7CGC77CCAC77CC777A7GGCA7CA7C7CCAA7G 
GGCT7C7TC7CCCTC77GGATCCAGG77C7CAGA77GGGGACA7GCAGAG 
7CCAAGGAACA77CCAT7C7CC7CCC7GG7C7AGAACAAGGAGGGC77AG 
ATA7A7GAGCAGG7GGC7GGGGC7GGCGAGC7A7G7AG7C7CCAA7GGC7 
7T7CCC7GA7G7CGGAG77GT7A7G7CAG77C7GGGAGACCAA7AAGACC 
77G7CC77CC777GGA7CCA7CAGAAAAAGCCCC7GGG7GGG7AAGA7GG 
A7GGCAGGGC7C7CC7AC7C7A7G7C7777C7CACACC7AG7GGG7A7AA 
GAGAGGGGACCACAAACAGAGGGGGC7C7GG7ACCAC77A7CCAGGG7C7 
GGAAACA7777C7G7AAAGGGCCAGA7AA7AAA7G777CAGG7ACAAC7A 
C7CAACC77GCA7CA777CAGAAAAGCAG7CAGA7AA7ACA7AAA7GAA7 
GGG7G7GGC7GGAC77G7CC7GCGG7CCCC7G7C77A7A7CA77G7A77A 
TA7CA777777C77ACA7ACAAA777AGAAGCAA7AC77AAAAAAAAAAA 
GCCG7CC777A77GAGCACC7AC7AAG7GCCAGG7ACC7777777CCC7C 
A77A7C77A77AAC7C7TCATAA7AACC777AAAG7AGA7AA7A77GAAC 
CA777GACC7A7GCAGAAAC7GAGG77GAGACAA7AAA77A777AAGACC 
GCACAAACAG7AAA7GC7GGAAC7ACGAC7CAAA7A7GGG77AAC7GAAC 
CAAAAC CAGA7C777A777C7CAC7777AA77G77ACA7A7G777 A77GC 
C7CA7C7CC7G7CCACA7GG7GCCCA7CGGCAGAC7CC777C7CA77C7C 
AG7GA77GAG7GACA77C7AAAC7ACA77GGCC7GGCAGA77CACC7C7G 
7CCCC7AAA7G777CCACA7TG7CC7777AGGA77GAGA7CC7C7C7G77 
CCC77G7C77CCC7CCT7TC77C77C7GGCGG7GACG7GC7G7G7GAA77 
TG777CT77C7CC7C7CAGGG7AGTAC7GGGAC777CCAAA7CAGGG777 
77AG7GA7C7C7C7TCCC7T77C7GAG777C77CC77A77CCCA77CAC7 
77C7CA7C7A7AAGTGGCAGC777G77GC7GGAGGA777CC777G7CC77 
T7A77C77C777AAGAC777G7CATAAC7G7CAAAAGCAA7CCC77GAAG 
G7A7C7G7CC77GGAAT7G7G7GC77A7GA7GC7GAAAAA7AC7C7C77C 
C7AAAGC7A77A7AAA7GC7 
>Contia42 

GGC7AGC7GCAAC7C77GAA7ACAAACACA77CAGACA7GCACACAC777 
C7GGC7CCCAAAAAGAAAAAAAAAAA7CAA777A7AA7AA77C7GA7CC7 
77GC77A777CCACAAAC7CCA7GAAAA77G7ACA77G7CCAAGCAACA7 
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TTCTTAATATTCTCTTx . TCTCTCATATCCATTTTCCTTAC7GC^STC * J 1 

CACCTATCTCTTCCAAACTCCCTGTTAAAATCCCTGCCCCAGCGAACTTT 

TATTCAATTTTGTGGAATGGAGGCTGCACTGATTTAAATTAAAAAAAAAA 

AAAAAATCCCTACTCCATGTCCCAGATCCCTAGTTGTTTTTTGTTTTTTG 

TTTTCCTGAGACAGGGTCTTGTGTCTTCCATGC7GGAGTGCAGTGGCATG 

ATCATGGC7CACTGCAGCCTCAACCTCCTGGGCTCAAGTAATTCTCTTGC 

CTCAGCCTCCCCAGTAGCTGGGAGTTCAGGTATGTGCTACCATGCCTAGC 

TXATTTTTTTCTTTTATTTTGTAGAGACACGGTCTTGCCAGGTTGCCCAG 

3CTGGTC7AGAACCCCTGGGCGGACGTGATCCGCCTGCCTCGGCCTCCCA 

AAGTGCTGGGATTACAGGCGTGAGCCACTGCTCCCGGCCTTGGGTGCAAA 

TTTGAGCTTTCTCACTTATTAGTGTAAGACATACAGCTAATTTCTAAATC 

TTCCAAACCTCAGATTTTTCATCCATGAAGTGAGGATTATTATAGAGCTC 

ACTAATAACATGGCTTCAAAAATATATAATGCCAAAATTGAGATCAAAAT 

AATAAATCTATATTACATGGGAGATCTTAATGTACCTCTTATATTATTGA 

TAGAC7AAGATGATCAAAAAAATAGAAAGAGAGCAGTAAGGAGAGCAAGC 

ATTTAATCAATAGGACCAATACATTTTAATCAATAGGATCCXCAGGAATA 

TATACAGAATACCAAACCTAACAACTGCAGAAAACATGCCAAACATTTAG 

GTACAGACATTGTTGGAAAATGCAATCTTGAAACGAGTGGACTGACATTC 

AGAAGATATTAATAAGAGCACTAATGATGGGGATTGCAACCATGTCTTTA 

CTGACTTCCAGAAGCTTCTTACAGTAAACATGAAATCACATAATTTCTTC 

CACTTTCCTACTGTTTCTTGTTCTGGGCTCTGTCCTGCTTACTGTCTAAT 

ATCTTGGCCCCTTAAAAGTTGCTAATCTTCCAAACCTCATTCCTGTGACT 

GGGCCGCTGGTCCTTGTTCATGGGCCTTGAAAATACTGACTGTACACTTA 

TCTGGAGCATCCAGTGCCTACCACCTGACCCAGATTCCTCATTGCGCTCC 

TCCCTCCTCCACCTATTGGAATTTGCTCATACCCGTGTGAGACCCCTCCC 

TTTCCCCCCATCTGAATTTTTATCAAGACAACGCACTGCCATACTCCCTC 

GTACCCTGCTCTGGGCATCAGAGTGAATGTTTGTT7CCATTGAGGATCTG 

CAGCTGCATCAGTTTCCCCAGCACCGTCCAACCCCTTGAGCATGGCTAGT 

CCTAAAGCAGAGAATTAGCCTTTCTATCCCTGCTGCTATACATGCTGGGA 

CAAATAATAAGAAATGACAGCATTTTATGATAATGCAGGCTGCAGGAGGC 

AGGAGGCAGGAATCAAATTCGTGCTTATCAAATAGTGCTCCAATTCTTTG 

AATATTGGACTATAGAATATGTCATGGATCTATGCTCAGGTGGGTTCCCT 

ATTACTCACTCCACTGAGGCCAGGTTGTGGGATTAGCTGTCCAAGAGGGA 

GTTTCAGTCTCACAGCATAGGGTCATTCTGAGAATTACTGGCCCACACTT 

GTGTGGAGACCTCCAGAGAACAGAATCTGGGTTGGTGCCATGTACTTCCA 

GGAGGAGAGAAGTGGCAGGATGCCCAGCCCCACAATCAGAGGGGAAGGGG 

CAGAGCCACATGTATGAAGATCCTCTCCCCAGTACGTGCCAATCACAGGG 

C77CC7AGC7777GGGCCAAGGAAACAA7G7GGGAAGCAAAAAAGGACAA 

7777C7CC7CCC777GCA7GAAGAC7GAGCAG7777ACCAGA77CCCAGG 

GAAACACCCTTCCACTCTGGGTTGAATGTGAGTGAGAGACATTCAGCTGG 

AACACTAGAAAAACTATTTCCTGAGCGACTCACCTTTAGCCCTAGAAAGT 

GTTGGATTTGTCCTTCATCTTTGCCACAGTAGAGACTGCTGATAGCATCA 

GAACTTGGGCTCTGGAATTAGACAGATATGGGTACAAATCTGAGCTCTCT 

CACTTATTAGTGTGGGATGTAGAGCAACTTTTAAAATCCTTCCAAACCTC 

AGACTTCTCATGCATGATGTGAGGATTGTAATAGGGCCCACCTAATAGGG 

GTTTTTGAGAATTAAAAAAGTTATTCAATGAACAGCATTTAGCAAGATGC 

CTGACCATTGAGAAAATAACAAATTGTTTATTATTATTGTTATTATTAAA 

CATCTTTCCTGCACCTTCTGACTGGGGGCATCGTATCATCAGAAATACTT 

AGGATGGGATGGATTCCTGCATGGGCTGAGTCAAGGGTGCAATAATGGAG 

GAGTGAAGAAGGAAGAAATGGAGGCAGAAATCCCCAGGAGCCCAGCATGG 

TACAAGGCTGAGCTAGTGCTGCAGAGCCTCCTTGGAACAGCCACAGAGCT 

TGCATCTGGCCCTGGGAGGAACCTCTTCTAGCTGGCAGGACCAGCCACAA 

CAGTGGCCAGGGGATTTCCCAGGGCGTGGGCTCCTAGGAGTTCATTTGGA 

CCAAGCCTGCCTGGAGAGGGGTTATAACAGGGATCCTTCCCTACTGGCAG 

GTGATTTACCCCTCGGTGAGAAGCTCAGGCATTTGTTTGATGGAAGGTGG 

AAGGCCCTGTGCTGGGCCAGTGACTATCAGGGATGGGCGGGTGGCTGGAA 

AATAGCAAATAAGACAATATGATAACACAGTTAACCACCACACTATGTGA 

AGCTACAATATGGGTATCTGTAATAGACAATTCCAATGTAGAGAATAATT 

CTAAGGTGTCATTCTCCCCGCCAATGCCATAAGCACACGGCCTCTGCCTG 

GG777C7CAC7G7GGAA7G7CC7CC7GG7C7CC7CA7GCCCAGAGAGTGG 
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GAAG7AC7CC7AC777,. .CACCGGC7T7CCTG7CA7C7CCC7GCAGCC~J; 

CCTCAGCCCCCTCTGCACAGGGAGGTTTCCTCCCTGCTGCTGCAGTGCTT 

TGTACTTGTTAGTGGTACCTGCACAOVGGTATTGGTGTCCTTGTCTCACC 

ACCCTACA7CACTGTAAGCTCCCCAGGAGCAGGCTTCCTGTTTGACTCAC 

CTGTGATCCTCCACCTCCCACCCTGTAGTGCCTCAAGCATTGAGGACAAT 

CACTGGCTGCCCCTTAACCCAGAAATGCTGCCGAGACAGGAGGCCATGGC 

CCAAGTTCCTGGAATGGGGTATTACTATGTCAGCACAAAGGCCTTTGCAC 

AAATGAAGGCTTTAAAAATGCAGTCCTAGTCAGGTGGAGGAGGGCTTATA 

GGATTCCCAGGAATCTGGATCATTCTCTTGAGAGCTTTCCCTTGTCTCTG 

TTAAAACTCACATCCTACGGCCCAAATAACAACAAAAAATGGATGTAAAT 

TCTTGAAATAACTTGTGGATGGGGGAACAAGGCCCACCCCCCAGATCTGC 

CAGAAGCTTCAGGTGAGGGTCCCAAATGCCAAAAAGTCTGGTATCAGAGA 

GGATGGCCAGTGACCTGGGGACACATGCCCTTTGCTGTGTCACTCAAGGA 

GCAGCAGCCTCGGCCCCGCACAGTGACCAGGACCCTGGCTTCCCACGCTG 

GGCAGGAGCTGGTGTCTGATGAAGGGAATGCCTGGCAGCACGTGCTGTCT 

GTCTCCTCGTGTCAGCTTACCTGGCTTTGCTGCGAAGAGGCCACTCGCAT 

TTCTCAATTTTTTATATTTTTTTAATITTTTAAATTTTTTA^TTTATTTT 

TATTTTTATTTATTTATTTATTTTTAATTTTTTTTTAATTTTTTAAATTA 

TGCTTTAAGTTTTAGGGTACATGTGCACATTGTGCAGGTTAGTTACATAC 

GCATACATGCGCCATGCTGGTGCGCTGCACCCACTAACTCGTCATCTAGC 

ATTAGG7ATATCTCCCAGTGCTATCCCTCCCCCCTCCCCCCACCCCACAA 

CAGTCCCCAGAATGTGATGTTCCCCTTCCTGTGTCCATGTGATCTCATTG 

AA777C777AAAGG7GGAA7CTC7CAGTGGGG7C7AATC7G77CAGAAA7 

ATCAAAAGAGTATCCTTGGGAATGACTGGAATTCCAGAGTCATCTGGTAA 

7CCTCATAAAACAACTCCTGGATGTCTCTCAGCACATCTCCCACC7TGAA 

CGCAGGAGGC7GGT7CAAA7GGAGGAGCA7CGC7C7AC7GCACTTTTTT7 

7TT7T77GGCCTAAAG7GCAAAAGGGGA7ACG777CATGTAAA7AAA7CA 

ACTGCAAA7CGC7AGT7ATGCTGAGCCCTGTCCCG7GC7G7GGACACAAA 

GGAACCAAAGGC7T77C7CCCCGCCCAACACACACA7AACACACACACAA 

AATCA7AAAAACATACATACCCCCAACACATAACAACACACAACACACAC 

ACAAAA7 A7A7 ACACACAACACACAC CAAACA7G C C CACAAA C C7G7G7C 

CAGAGA7AGATCCTAC7GGTGGGT77GTGGTC7CGC7GAC77CAAGAATG 

AAGCCG7GGACC7TCGCAGTGAGTGTTACAGC7CTTAAAGA7GGCATGGA 

7CCAAAGAGTGAGCAG7AGCAACGT7TAC7G7GAAGAGCAAAAGGACAAA 

GC77CCACAACCCAGAAGGGGACCCCAGCAGGGTTGC7GGTTGGGGTGGC 

CAGC7777AC7TCC7TTTGGCCCC7CCCA7G77C7G77TCCA7CC7A7CA 

GAG7GCCC777777CAA7CC7CCC7G7GA77GGC7AC7777AGAA7CC7G 

C7GA77GG7GCA7T7TACAGAG7GC7GAT7GG7GCG7777ACAA7CCCC7 

7G7AAGACAGAAAAG77CC7GATTGG7GTG7777ACAA7CC7C7TGTAAG 

ACAGAAAAG77CCCCAAG7CCCCAC7GGACCCAGGAAG7CCACG7GGCC7 

CACC777CAACTCCATAA7GGCA7GAAAATACA7A7GTTG7ACAAAACA7 

ACA7ACACAAAG7A7ACA7GCATCTCCCCAAA7A7ACACA7ACCACAGAA 

ACATACACACAGGAACTCAGCTACCTGTCAAAAGTCTGCATGGTGATTGC 

CTC7GCAG7GAGTAGTTAGAAAAGTGAATTTGTT7TTCAA7AAATTGGAG 

7CC77AAAAATCGTTGTAAGATAGAAAATTTTTAAAAGTATATAAAATAA 

AA7A7G7A7G7CC777GG7C7AGCA777ACACA7G7AGGAA7T7A7CC7A 

GTGGAG7AA7CAATGATATATGCAAAGATTTGGACAAGCATATTAAGCAC 

AGAA77A7G7A7GCATATGTGTGTGTA7A7A7A7A7A7ATC7CATACA7A 

7AA7AA7G7AAAAG7GAAAA7AAC7CAGA7G77CAAAA77GAGGATTAG7 

7AGAC7A7GATCTG7CCATATGTGACATACAAG7TAGCTGCCCCTTAT7C 

7C7CGAGCT7CAACCTCCTATAAACAG7GTCCC77G7ATATCAG7ATTGG 

7ACAGA7AA7CGAAC77ATTGAGGTTTTACATGGGGCAA7AAAGGCAAGA 

G7T7A7GAA7AC7CCA7AC7ACACTAGGTAGCACCCCCTAT7AAAGACAA 

ACTC77CTC7CTCA7TTCCCTTCC7TTCCGGAACCACTTGGTTGAA7CTC 

7ACAAG7CTCTATTGCAACTGCCTCAACATGGCACCCTCCCTGCATCTCC 

A7C77CCCTG7CC7GAGAGCAATGGCCTGC7GCCCCCACAC7CACA7CC7 

CA77 C A77 C CAGAAG7GAGCAC CACAGAAG7GC C7ACAG77AC C C CAAC C 

ACCT7C77AGAAGATAAG77AG7G777G7T77GACTTTTTAAAA7TTTTA 

CTTCC7C7T7TCCTTCACAA7CTCATCCCATCCCAAGAGG7TTATCAAGA 

AG77Z7C7AAAGA7A7G7G7CTCC7TA7GGAA777AACAGAAA7CAGGGA 
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. ± - « iA* ^ w i iUiCwAi v iGGGAATAACATTTTTCCAGGTCTTTAGAC, . 

A7AA7GGAA7ACC77GCAG7AA77AGA7ACAC7A77G7AGAAA£GtATTG 

ATGAAATGGAACGATGTT7GAGATATCATATTGAGTAGAAAAGGCAAGA7 

ACA77AAG7AGGAAA7G7A7C77ACAAAA7AA777G7CAGACACAC7CC7 

A7A777G7A7G77A7A7AAA7GCG7A7G7GAAGAAAGGC7AGAGGA7GAG 

ACCACAG7C77CGG7GAAG777AAGAGA7GA7GC7GCAGCA7GC7CAGAA 

AGGC77GG7A7AG777777CCAG7AA77AAGGAC7GA7C77AGG7AAA77 

GTCCA7CC7C7C7AAAG7GCACCACC7777G7C7G7AAAACAGGAAGGA7 

GG7A777ACCCCCAGGG7CA7CAAAGGA777GG77GGAGAAAAA7AAA7A 

AA7GGGC7GAGCCCAGACC7GGCACAG7GAGAGCACAG7GG77GAC7A77 

G7GC7GGCC7G77G77CC7G7G77A77GACA7GC7GC7GG7GG7GG7CCA 

GAAGC7A77ACC77AA77GG77A7G7GGA777CCCC7CA7AC7GAGCAGC 

7G7G7G7GG7G77G7AAAACA7AGCCA7ACACAG7AAC7GACAAGGGCAA 

A7G7GA7GGAAAAA7GCAAGGAAG7GCAGA7AAA7AGC7AA7GGGC7G7A 

GAAGGAAGC7AG7CC77GGAGGGC77GA7CAAGGAAGG7CC7777GCA7G 

7CACC777GAAGAAGAGGGGACA7AGAAGAGG7A7AG7GCA7CCCGGAG7 

G7ACC7GGAAGGGAACA7GAAAAGAGGACA77777C7C7GGSACA7GGGG 

AC7CCAC77GCA7GAAC7C7GGAA77GGGGCAAAGAACCA7CA7GAGAAC 

AAGGGC77CC77GAACC7CCCAGGC7CA77GGC7GA7C7AAACCC7G7G7 

CCCC7C7T7CC77CAC7C7CC7C7G7777C7A7ACC7G7A77A77GGAC7 

GGAC7GGAAGCCACC7GA7C7A7CACAAG7ACC77GAAA7G7G77GAA7A 

GG7G7GGCACAG7CC77AGCAGAG7GGCAC7ACCCCCACAGGAA777G77 

7A7ACC7T7GGCA7GGAAAA7AGCAGGAAA7GAG7GA7CAC7GA7AAC7G 

AGGA7GCTA777A77A77GGCCAAAGGAA7AC77G7G77G7A777GCA7A 

ACCAC7CACAAAC7G77GA77ACAAA7GAG7ACCAGACC7AGC7CC77CA 

AG7AAAGGA7C77GAGAAC7GAAGGCAAACAGAGC7CCAGGAG7CCAAGA 

CAGAGCCACAGACCACGAGGA7CCC7GGCCCAGG7AGG7GG7CC7CC7GC 

AC7GGC777CAAGGCCAACAGGA7GGA7GGGGAAG7AGAG7AGCA7C7GG 

CCATC7AGACCC77GC77TTTA7CCCCAC7GGAAGCACA7C7GAA777C7 

AAA7A7GA7C7C7GAGACC7GCCCAGAACACC77GC7C7CAGCCCCAG7A 

GCAGCC7GC7C7C7CCCAGGAGGGC77CCAC7AACAAG7AGGGCAT7GC7 

GGAGGGCCAGGCAGACAC7AGC77AGGAAA7CCACCAACCC7GGAAA7GC 

7AG7CCC77C7C7GAAGGC7CAGAAGAC7GAC777AGAG7C7AGAAAATA 

77GG7CC77GGGAACAGA77T7GAG7GCAAAGAGA7GGAC77CAGATGGC 

CAGA7GCAC7GC77C777AGGGAA77C7G7GAAAGC7CCC7GCA777A7C 

77AA7ACAGGCAGCAGA777CA7GAG7ACCCCCGAGGGA7GGCCCCAGG7 

CC7CCAGCC7G7GAGCA7CC77C7G7CC77CAGCAGCACCACAG7A7C77 

TA7A7G7C777GGA7ACCTACG777C7GCCAGACA7C7C77GC7C7GA7G 

77C7GGCT3CCAAA77C7C7G7CAAGCGCC7CCAA777777G7G7CC777 

GA777ACCCCAACA7GACAAAGGCAG77G7GC77CA7G7A77CAGGGA7A 

C7GCCAAACCACAAACAGG77AAAA7CAAA7AGCAGA7A7CCC7G77CC7 

AAAGACCCA7CAGC7C7ACCCACC7GC7CC7GC7CACCG7CC77A77G77 

GAG7CCTGAAGCCC77CC77G7CA77777A777777GCA7GAACAA777A 

G77CCC7T7G7C7CAC7CC7AAACC777C7CAAAGGA77GGA777G7ACA 

CAAAC7GCCTA7C7C7GCAATCT7AGAAG7GA7ATGA77C7GAACAAA7C 

AC77AAC77T7GA777TTTATTGGTAAGATGGGAA7ACCAA777TTGC7C 

CAC77C7G7CC7ATG77GGCC7GGGC7GA7G77GAAAGC7C7CGG7CAAC 

7GAGA7AGGG7G7GCAGAA777A7A7A7A7AAA7A7A7C7CC7CCAACCC 

C7CCCAA7GAAGCAAG7CACG7GAG7CAA7CC7ACCC7AAGA7A77AGGG 

A77GAGCC7CC7GGGACAT7TGG7GGC77AGG7TTTCA7GAAAAGAGG77 

GCAGAGCAAC7GC77777G77AGGCAAAGA77AGGC7AC7GCAGAGAC7C 

AGCAAAC77C7A7AGAAGG7G7CAGA7GG7AAG7A777TAGGC777GC77 

GCCAGA7GA7C7C7CAAC7AG77AACCA7GC7A77G7AGCC7CGAAGCAG 

CCAGAGACGA7C7G7AAACAAGAGCA7G7AG7G77GGCA7AAA7A7AG7A 

CCGCG 

>Concig43 

GCAA7AAG7C7A777AC7G7AAAG77AA7CAAA777ACA777CAGAACAC 
77AA7C7GCAAGAG7CC777CCAAGACCC7A7ACC7AA7777G7G777AC 
AA7777A7A777G7777CT7AAAGAAGACCACCAA7A7AAAC7A7A7CCA 
GCC77CA7GA7AAG7ACA7AAGAAAC7A7GCAAA7AAGGGGGAAAAAAAA 
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CAAAGAAAAA7ACC7AL . f7AC7AA7GG77CAC7TC7GAA7AGC^CA7^i 

TCATAATGATACAAGCACTCATTACTAG7CTAGGAAAATGAAGATATAAT 

TGCATTAGGAAGATCAAGAGGTAGGAAATGTGGATG7GTGTGGTATAGAC 

TAGGGCAGGACAAAGAACCTAAATCCTCATTTTCTAAAGATAATTGTTAA 

TACGTAAAACTCAAAATTCAAGAAGTAACAGTAAAAGCGGTCATTAAGAA 

ACAAGCACTAAACACCAGATAGGAAGCGAGAGATGGGGGAAGAGGGCAAG 

AATCTGATTATTTTTTGCAACAAATTTTGTAAAACCATTTGACTGTTTAC 

ATGTAGAACTTGGATCTTTTTTAAAAAACACAAAATAATAATACTATTAT 

TTTTTAAC7GGATTTTTGAAAAAGAAGATAAAAGTCTCATTTTAGTAATT 

AAAACTCATTCCAGGTTAGTCCACTCAAAACTTATATTCGAAAATTAAAA 

CTTTGGGAGGCTGAGGCAGGCAGATCACCTGAGGTTGGGAGTTCGAGACC 

AGCCTGACCAACACGGAGAAACCCCGTCTCTACTAAAAATACAAAATTAG 

CTGGGCGTTGTGCATGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAG 

GAGAATTGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGAGATCACA 

CCATTGCACTCCAGCCTGGGCAACAAGAGTGAAACTCCATCTCAAAAAAA 

AAAAAAAAAAAAAATTAAAACCTCTGGAAGTTGAGTTTGCAGATATTCAT 

TATGCTCATTTTTAACTTGTATGTTTGGAAAATGTCATGATGAGAATTGA 

GGTTGGGGGATGAGAAAAAAAGAAAAACATCAACCCCACAGCCCATTCAA 

TTTTCAGCCCGACCCACAGCTCCGGGGAAGGGCAGCAGGTCCATCCTTCA 

CTCTTTCTTCACCTCTTTCCCCTCCTTCTGGCTCTTCCACCTCTAAGTTG 

GAGCCCAAGAAGAGGCAC7GGGAAA7GGAAAAG7C7777G7ACGTGG7AC 

TTGCCGGGGAAGCTGCCATGAAGACCTGGCCCCACGGTGGGGAGGGAATG 

CCCAGC7GAGGCCTCGTGCCCATGCTAGGATAGACTCGTCCAGACATGTC 

AGGTGG7CTGACAGGGCAAGCAGCAGGAAGTCATGTATGAGTATGAAC7G 

A7C7G7A7GCAAGGGCGGGGAGAACACGCGGAGGAA7GGGGCG7GAGAAA 

ACAGCACAG7ACG777C777AGCAGC7G7C7C7GC7CAGCCA7GGGAG7C 

ACCAGAGAAAGAGGCT7GGAGGCG77A7777CAC7G7GAGA7G7GAG7G7 

AAAAAAG7GCCCAAGACACAG7GAG7ACCAGGGAGA7GCCC7C777CCC7 

ACCCGAA7GCAGAA7GGCCACAGGCC77AAAACACACACA7GG77CC7CA 

GAGGAGAGAGGCCTCCACAG7GGACACCCGCA77C7CCCC7GG7CAGCAG 

CAGCAGGGCGAG7GCTGGGCCATCA7GAAGCT7CACAGGCAA7GAGC7C7 

CAGCAA7AACAGGAACAG7GCC7GGGGGAC7G7AGC7GCAAGACCGA777 

7CA7G7AAGA7GGCC7C7GAGGAC7CCGAGA7ACACCAGGC7GAGAC7AG 

CTGGCAGC7CCAAG7TC7TGGTCAGAAGAGAACAGGAAC7AGGGAAA77G 

GAA77AC7G77AC7ACAATTCC777ACA7CCGCACAACCA7GAGG7CCAG 

AGAG7C7C7C77A77T7TT77T7AAAGACAGGG7C7CAC7C7G7CGCCCA 

GCC7AGAG7GCAC7GG7G7GA7CA7GG77CAG7ACAG7C77CACC7CCCA 

GGC7CAAG7GACCC7CC7GCC7CAGCC7C7CAAG7GGC7GGGACAGCAG7 

7GCA7GC7ACCAGGCC7GGC77777777777777TT77777777777777 

7CGG7AGAGAC7GGG7C7C7C7G7A77GCCCAGGC7AG7C7CGAAC7CC7 

GGGC7CAAG7GA7CCTC7GGCC7CAGCC7CCCAAAG7G77GGAA77ACAG 

GCA7GAGACAC7GCACCCAGCCAG7A7AG7C7777AACAGC777A77GAG 

G7ACGGC7AACA77GAAAAAAC7ACACAAA7G7AAAG7A7G CAA777GA7 

AAT777GACAAA7G7ACACACCAG7GAAAC7A7CAC7ACAG7CAAAA7AA 

7GAACA7A7CCA7CACTCCCAAT77CC7CACGCCCCTTGG7AACCCC7C7 

C7CCCAAC7CCC7GCCCCCTAACA7CAGACAAC7AC7GA7GCA77C7G7C 

7CCA7AGGC7CA777ACA7777C7AGAA7777ACA7AAA7AAAA7GACAG 

AG7A7A7AC7CC77CA7GTATGGC77C777CAGCCCAAT7A7G7CAAGA7 

7CA7GC77A7GGC7GTGCG7A7CC77AGCCCA7C7C777G7C77GC7GAG 

7AGGA7ACCA77GCA7AGACAGACCACAGC77GC7CA7CCA77CAC7C77 

GACAACG77GAA77GTC7C7G77TT77GCAA7GACAAA7AAGG77GC7A7 

G7ACA77CC7G7A7AGACA777G7AAAAGCACAGCA777CA777C7C77G 

GG7AAAGACC7AAAAG7GGAAAGGC7GAG7CA7A7GG7AAA7A7A7A7G7 

C7AAC77777AAGAAAC7G7CAAAC7G77ACCCAAAGGGA77G7ACAA77 

77ACA7CCCCACCAGCAG7G7A7GAAAA77CCCG7AC77CCACA7CC7CA 

CCAA7A7A7GG7G7GG7CAA7C77777AA7777GGACA7GN7AA7GAG7G 

CAAAA7GAGGCCCAGAG7G7C7GAAG77ACA777G7A7CC77777GGCA7 

CCAAAACAGG7G7CAAGCA7AGAAAAAACAC77G77CC77GAA7GG7CAG 

TCA777ACAAG7GGAA77CA77ACAAACCGG7AG77C7AC7GGG77AAAC 

7A7GCG77AC7G7CAACAGGCACA7ACACA7ACAGACAGACAGGAAGGCA 
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CAGAGACAAGGCAGAGC . . 7GA7AAGAAGG7GACC7GGGC7C7AGC7Ca w ■ 

GCCTATCACCTAGTAAAATATTAGTTAAGTAGCCATGAGTAACTCACTTA 

ACTTACCACAGGCTCCATTTTCTTATCTGTAAAATAGGAACATTGAAACA 

GCTAA7CCCCAAGGTTTQTGGATAATCAGAATTACAAAGATCAATGACAT 

TTCTATGAGAGAAACATATTTCCAAGTATTTGATGGAGTACATCAGAGAC 

AAAGGAAAGGAAACTGAATA7T7 T 7GAGG7 7 7 7 TTT 7 T7TTACCAAGAAA 

77CACA7777G77AAA77T7CAGAAC7ACC7CC7GAGGAAAG7GTAGC7G 

CACC CATTTAGAATGATAGAAAACATCAATCTGTCTGATTCCAAAGCCAA 

GTTCTTGCTACAACGAGAAATGAAACAACTGGATCCCTACAGATGCAGAG 

ACCTGGGCCCCACAAATGTGAATTCTGTTCCCCTACCGAATAGAGTTACA 

GTTCCATAATACAGTACTCCCTCACTTTTCCACAGTCTG\CATTCCACAG 

TTTCAG7TACCCACAGTCAACTGCAATCCAAAAATATTAATGAAAAATTC 

CAAAAATAAACAATTCAGAAGTTTTAAATTGTGCTCCATTCTGAGTAGCG 

TGATAAAATCTTGTGCCACCATCCCACCTGTCCAGCTTATCGTTAGTCAT 

TGACATCGTCTGCTCCTGACATCCAACCATTGACATCATCATGACTCTAT 

GATCCAGGATCACCGAAGCAGATGACCCTCCTTCTGACATATCATCAGGC 

CAATATCAGCCTAAACACTGCATCACTATGCCCACATCAGTCACCTCACT 

TCATCTCATCAAGGAGGCAATGGATCACCTCACATCATCACAAGAAGAAG 

AGTGGGTATAGAACAATAAGATAATTTTGGGGCAGGCATGGTGGCTCACG 

CTTG TAAT C C CAATACTTTGGGAGGCCAAGGCAGGAGG ATC C CTTGGGC C 

CAGGCATTCAAAACCAGCCTGGGAAACATAGTGAGACCTCCTCTCTCTGC 

AAAAAAAAATAAACAAAATTATCCAGATACAGTGGTGCATGCCTGTGGTC 

CCAGCTACTCAGGAGGCTAAAGTGGGAGGATCACTTGGTCCCAGGAGGTC 

GAGGCAGCAGTAAGCTGTGATCGTGCCACTGCACTCCAGCCTGGGCAATA 

AAGTGAGACCCTGTCTCAAAAAAAAAAGGTAATTTTGAGAAAGAGACCAC 

ATTCATACAACTTTTATTATAGTATATTGTTAGAATTGTTCTATTTCATT 

ACTTATTGTTGTTAATTTCTTTCTTTGCCTAATTTTTTTTTTTTTTTTTG 

AGTCGGAGTTTCACTCTTGTTGCCCAGGCTGTAGTGCAATGAGACGATCT 

CAGCTCACCGCAAATCCCGCCTCCCGGGTTCAAGTGATTCTCCTGCCTCA 

GCCTCCCGAGTAGCTGGGATTACAGGCGCCTGCCACCATGCCCAGCTAAT 

TTTGTATTTTTAGTAGAGGCGGGGTTTCTCCATGTTGGTCAGGCTGGTCT 

CGAACTCCTGACCTCAGGTGAGGCCTCAGCCTCCTAAAGTGCTGGGATTA 

CAGGCTTGAGCCACTGCGCCTGGCCTCTTTGCCTAATTTATAAATTAAAC 

ATTGTCACAGGCATGTATTAATTTATAGGAAAATCATAGACATATAGAGT 

TGGGTACTATCCACAGTTTCAGGCATTCACTGAGGGGCTTGGAACACGCC 

CTCCTCAGATGAGGGGGGACTACTGTCATCTCCTCAATCATTCTTGATTC 

AATCCTCAACACAAATGGTTTGGCCAGGTCTTGCCTCTGGAGACAAAATT 

GCTAAGGATTTAGAGGGGAAAAAATG7AGTTCACTGGGAAAGTCACCTCT 

GC7CGAC7GGACAGCAACTTAAAACCCAGGCCATGACAAGTAGAAAGGCC 

ACCCCCAC7C7CC7TCACACC7GGAG7A77CAGGAG7CAA7CA7A777CA 

GGACCACCAGGAGCAAACTGGGAAAAACTGAGCTGCCTTGAGGAAAGCAA 

7CAGC7CCACAAGGGGCTTAAGAAACAAGCTC7GGGAGGAGTGG77GGAG 

AAGAG77GGGGACACA7CAGAAA7GCCA7CAAAT77C7AAGGGC7ACC7C 

GTGG7G7CAGACCTGTGCATCTTCAAGGACATAAACAGATGGGATAAGCA 

GA7GAGA77CACAGAGGACA7CAAAA7A77GGC7CCCCAGAAGGGAGAAC 

ATTC7AGTAACAGAGCTGCCCAGCTGCAGAGTGGACTGTTTCACAAAGCA 

ACAGG7GCCCTGCCTCTTGAATCACCATCTTCACAGGAATGCAGTAGAAG 

GGAC77AAC7CCTGCCCTGAAGAAAAGGTTAGGCTAGGGAAACAGCTCCA 

AAA777777AAAAGGAAGCAACA7AGGCA7C7AC7GGGAG7777C7AAAG 

CC777G777AA7GAAAC7AAAGAGC7GGGACAGGAAA7GCCAAA77AAA7 

7AA7AGAGCCTTGCTTTAAGACAATGCAAGTGGA7GGTAATGAAGGAATG 

AG7C77AGGCCTTGGATCAACCGTAT7AAGCAATGCTGAGCA7GGAGCCA 

AT7C7G77CACTAGATTTGCTCAGAAAGGGCCAGACGAGAAGGA777TTC 

7AAAGGCACCTACTACCAAAAAGCTGCCAAGGCGTCCAATGGAGCCCAGA 

GAGAA7A7GCTAACAATAAAAAGTTGAACACCCTCAATAAAAAAGGGTAA 

AAG7AA77AA7AGAAAA77AC7GAAAG C777777 GAAACCAAAAG7AG7C 

AGCA77GG7AAAAGTCTACAAAAGTGGACAC77TCATATAATG77GGCAG 

GAGGG7AAAAAGACATAACCTTTT7GGAGGACAA777GGCAACAGAG7AC 

CAAAAACC7TACAAT7GAAGAGAAC777GGCC7GAG7GCAGTGGC7CACA 

CC7G7AA7GCCAACACTT7GGAAGGCGAAGG7GGGAGGA1TGCT7GAGCC 
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CAAAAG7TTGAGACCAGLCTGGGGTAACACAGTAAGACCTCGTCfCTATu ; 

AAAAA7AAGAAAAG77AGC7GGGCA7GG7GGCA7G7GCC7G7GG7CCCAA 

C7AC77GAGAGAC7GAGGCAGGAGGA7CGC7TGAGCC7CGGAGG7CAAGG 

CTGCTGTGAGCCATGT7CATGCGACTGTTCTCCAGTCTGGGTGACAGAAT 

GAGACCC7G7C7CACCAGAAAAACAAGGCAAGAGAGAGAGAGAGAGAGAA 

GGAGAGAAAGAAAAGAAAGAAAGAAAGAAAGAAAGA7GGAAGGAAGGAAA 

GAGAAGAAAGAAAGAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGA 

AAGAAAGAAAGAAAGAGAAAGAAAGAGAAAGAAAGGGAGAGAAAGAAGGA 

AGGAAGGAAAGAAAGAAAGCAAGCAAGCAGGAAAGGAAGGAAGGAAGGAA 

GGAAGGAAGGAAGAAAGAAAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAA 

GAAAGAAAGAAAGAAAGAGAAAGAAAGAAAAAGGGAGAGGGAAAGGGAAA 

AGAAAAGGACAAAGAAAAGACCTTTGAACCCTGAATTTC^CTTTTAGAGA 

TTCATCTTAAGGAAATTCATTCCAATAGAAATTTATCCC CAGGATTATCT 

AAA7A777GC77T7A7T7TC7TC7AG7AA7777A7GG777AAC777C7CA 

TGTTTAAGCCTTTAATTTATTTGGAATTTATTTTGGTATGAGAAAGTGTG 

ACCTTTTTTTGTTTTACTTTAAAAAAAATGTATTACGATTATTATTTTAG 

AGACAGGGTCTTGCTCTGTCACCCAGGCTAGAGTGCAGTGGTGTGATCAT 

AGCTCACTGCAGCCTTGAACTCCTGGCCTCAAGCAATTCTCCCTCTTCAA 

CTTAGGAGTAGCTGGGACCACAGGCATGTACCACCATGCCCAACTAATTT 

TTTTTATTTTTTGTAGAGACAGAGTCTTGCTTGTTGCCCAGTCTTGCAAT 

GTTG7CTCAAACTCCTGGGCTCAAGTGATCCTGTCGCCCCAGCCXCCCAA 

AGCACTGGGATTACACGTGTGAGCCACTGCGCCCAGCTGCCTTTTTATTT 

777AA77777CAGA7GC777G77GG77CCAAAA7AGCAC77A77AACCCA 

CGCT7TCCCCCTCTGGTTTTAAATACTGCAAGT7TGGCTTTGAAATACAA 

CCCAC7GCC77A77CAGGC7ACA77CAAGGAAA7C7GAGACCAAGAG7C7 

GAAGGCCCAG777CCT7CCTCAAACCCAGGAGG7GG7AAA7G7G7CAC77 

CCACAC777C7A7C7A77TCTAAGAAC7CC77C777CCAAACTC7GACA7 

GCCCC7GGC7CAGG7C7ATAGAAA77CCCAGGG7CCACAGACAAAGCAGA 

ACTCAC77A7GGGGAAA7C7GGGAAA7AC77A7C7G77AAACC7GCCCCA 

TATGG7GAC7CAGA77G7C7AAAGCCCAAAGCA7CA7777CCACCCCAAA 

CCA777CC7CC7CCAGAC77C7C7A77TC7G7GG7CCAGAG7CAAGATC7 

7GA7A77ACCC7AGAG7CCCCC77C7GC7C7CC7GCA7ACCCAGA7GCCC 

CTCCC7CCCCAGA7CCA7TCTCCCACCC7CCC7CCCA7CAG777GG7GGG 

CCCA7CACCGC77CCCC7GGCCCAGGC7C7CC7777G7GCGC77GGAGCA 

GCAGAC7GA7C7CCCAGCC7TCAC7CAC77CA7G7GG7AA7C7G77G7G7 

7CA7CAC7G7CAGAA7C7TCTGCA7CCCC7CAC7AC7C7GC7GAAAACAC 

7C7AG7GG77CC7CA77GC7CA77AA7GAAAG7C7AGA7A77AAACG7AG 

AAGGCCCAGCACAA777GCCCC7A7GCCACC7ACC7C7C7AA7C7777C7 

CC77AC7C7GACAGAC7C7CCG7C7G7CA777A7G7A77C7777A77GC7 

C7C77C7AC7777AG7A7GAAC7GGA777A7GGA7777777AACA77GC7 

T7CAAG7A7GGAA7AAAGAA7777A777A777A777A777A777A777GA 

GAC7GGG7C7CAC7C7G7TGCCCAGGCCAGAA7GCAA7GG7GCAG7CA7A 

7CTCAC7G7AACC7CGAATTCCTAGGC7CAAGCCATCC7CC7GCC7CAGC 

C7CC7AAG7AGC7ATGACTACGGGTGTGCA7CACCACA7C7GGCTAATGG 

AA7AAAA7A77ACAA7GCC7AA7C77AA7777CAAAA7777AAA77ACA7 

7GTACC7AA7GCCCATGCA7T7ACT7TTTTCAGTGGG7CAA7AGCCCTCA 

C7T7GGCAAAGG7CCCAGGCCCAAGG7AAGGCC77AC777 77 CCAAAC7C 

A7C7777GAAAGACA7AAG7GCCTG7AAGT7G7ACCACA77AGG7TCTAG 

GAA77777CA7CAAAGACT7TATCAGAC7A7777CC7C7AAG77GAGAAA 

GAGC7GGGGGCAGAATATGGCAC7GAA7GAC7GAAGAGAAGGCAC7GAAA 

7CAGGCCAGAGG77GC7GGAAAGAGCAA7GAGGAACACCAGCAGCAA7GA 

GGAGCCGG7GA7GA7777GGC7TCACAGGGAGG7G7G7ACCACACCGA77 

77A7C7C7ACG7GGA7GAACCACAGC7GTCGGC7CCC77G7CTCCAGGAC 

A7CACAC7C7CCACA77CCCTCCCA7CTTCCGGC77C7GC77CCCGGGGC 

CCTCA7C7GCCCCA7CC7GGG7GAACAC7GG7CGG7CAAC7GC7GGGCG7 

ACC77CCCGC7CTGCACACCCTCCC7GGCCACCCCACCCAC7CTCACGGC 

7CGCAC7GCAGAGGAGCCGCA7C7C7AGC7CCAGCCCA7C7GCC7C7TC7 

GAGC7C7AAC77CA7G7AGGCGAC7CC7GCCGG7G77GCC7CACAGGCCC 

A7CA7AC77CAAAGCA77TTCCCC7CAGAACACCA7G7CC7GGC7GC7CC 

C7CCAGAAGA7ACA7C7C7CAAGCACA7CCCCGCGGC7C7CACC7GGA7G 
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ACTGCATTCACCTTCTC -ACATTTGCCCTCCTTTGGATGTATATAGA . 

GTTTTAAAATACAAATCTGATGTGCTTGCTCTCCTGCTTGAAACACCTCA 

AAACTGCCTTCAGGATAAACCACTGCCCTTGACATGTTCACAGGTTGCCC 

ATGGCCTGGCCCTGCCCATCTCTTCAGCCTCATCTCATGCCCCTTGCCCC 

TCGCTCTCTGGGCTTCTGCCTCCCTAGCCCTCCTTTAGGTTCTCTAACAC 

ACCATAGTCCTTCTAGTGTTGGGGCCTCTGCAAGTGCTGTTCCCATTGCC 

TGAGACATGAATCCCTCTCCCTATCTCTACCTGCACCTTCATCTGATTAA 

TCXZCTACCCTTCCTACTCATGATGTTGCTTTCTCAGGGACTCTCTCTGAC 

TTTTTAAACTAATCAGGGTCTCCCCAGTATATATCTTCATAGCACTCTGT 

ATTAC7CCTTTCTTAATGACCACCTGCTGTAGACTGAATGTTTGTCTTCC 

TCCAAAATTCATATGTTAAAACCTAGCCCCAAATGTGATAATATTTGGAG 

GAAGGCTCTTTGGGAGGCAGAGCGCTCATGAATGGGATTAGTAGCCTTAT 

AAAAGAGACCCCTGAGGGCTCCCTTGTCCCCTCCACCGTGTAAGGATGCA 

ACAAGAAAGTATGGTCTATGATCCAAAAAGCAGACCCTTGCCAGGTACCC 

AATATGCTGGCACTTGAACTTCCCAGCCTCCAGAACTGTGAGAAATAAAT 

TTCTATTTTTCATAAGCCACCGAGTCTATGGTATTTTGTTATAGGAGCAC 

AAACAGACTGATGTGCCACCCAACCATGATTATACGTGTAATTTATGGTT 

TCTCTGCTAGTAGGGATGCACCATGGGGTTAGGAACCACGCTTTTCTTAT 

TTCCCACACAGTCCTTAGCTCTAAGCATGTTCCTGAATCAAAGATCCCCA 

TCTTTTATGAATGAAGGAGTCAGTGAATGAATTAATGAAAGAACTGATAA 

CCCTCAATAATTATTCCAGCCTTTTATACCTACTATTAACAAGCTTGCAT 

TCTACTCCAAATTTATTGGGCTTTAACTCTATTTTTGGCCAGCCAtATTT 

GACATTCCCTGAAGTAAATCTATGCTTTCCATCCTAAGTCAAGGAAGGAC 

CTGGACTAGTAGGGCCAAGAAAGGTCTAAATTCCATGGGTGGGAGAGAGA 

GACTAAATCTGAAAGGAAGAATAGATTGAGCAAAGGTGTAGAGATTGGGG 

AAGGCTGGACATTTGGAGAGAAGGAAAAGGAAACTGACACTAAACCAAAC 

AGTCTCACAAACACAATCTCATCCTTCCAAAACTCTGTGAAGTAAGAATT 

ACTATCCCAGGGCCAGGCACAGTGGCCCATGCCTGTAATCCCAGCACTTT 

GGGAGGCCAAGGTGGGTGGATCACCTGAAGTCAGGAGTTCAAGACCAACC 

TGATCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAATTAGCTGG 

GCATGGTGGTGCACACCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGG 

AGAATCATTTGAACCTGGGAGGTGGAGGTTGCAGTGAGCAGAGATCGTGC 

CACTGCACTCCAGCCTGGGTGACAGGGAGACTCCGTCTCAAAAAAAAAAA 

AACAAAAAAAAAACCAAAAAAAAAACAAAAAACAAGAATTACTATCCCAG 

TTTTGCAGATGAGGCAATGGAAGCTCTAAAAAGTTAAGTAGGAGAAACAA 

ACATGAAATGTATGTCTTATGCTTTTCCTCATCCTATTTCCTCAGCCTGG 

AATGTCCATTCTCCCTCCACTATGCAAATCTAACTCTTCAAGCTAACACA 

TAGCAATG7CTGAGAAACCGTCCCTGTGTTCACTCTGTTAGCCTCACTTG 

CTCCCTCCCCATCCCTCTGTTTCCTTTCTGTTATAACACTTCTCTATTCT 

GCTGGCATCACAGTCATCTCCACCTGCCTTCCTCACAAGTTAAAAGCTTG 

TTAAGGGCAAGTGGTGTTCTTTGCCACCTCATTCCCCAGGGCTTCTAACA 

CAGTGCCTCATGCATGACAGAGTTGTAAAACAGGTTACCAAGCTGGCTTC 

AGGCAGGTTTGCATGGAACTGTGCTTTACAGGAATACCTGCTCCCCCCAG 

GCCCTGGGTCTTCCTCCTGAGTCCAGGCTCAGACTCTCTCATCCTGCTCG 

TTCTCTCTTGGGGAGCCACAGTAACTTTGAGCAACTTTGCATGGGATAGA 

ATGGCCTATTAGGGGCAGCACAAAGACCCCATGGAGGGAAGAGTACAGAA 

AGGGAAAACGATAATOVTATTTTTTTAAGATGTGCATTTTCTTAACAAAA 

TGCTCTAGTACTTGTCCAGACTTTCAAACTCAAAAACCTAAGCGTCCTTT 

TCTTGAAGATCATCAAAGGCCCCAGTGGTCCTTCAGGTATGTCAAGCTTT 

CTAGAAAATAAAGGTAAGTCATAATCACTTAACACACATGGCTAAATGGC 

CATTTCCTTCTAATTTATCAGCAACTGTTACATATTTCTATACTAGAAAA 

AATTTATATTTATACTCAGGGTGGTAAGTTAAATTTGCCATCGAAGTAAA 

GCAGAAAGAGCGTAGCATGTATGTATATGTAACTCAACTGTGCATGAGAC 

AAAGATGTCTTGAGGAGAATGAGTCTAAGATGCGCCTGAGCAATAGTACC 
C 
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>Contigl 

GCACCCATGTTTCTAAAGGGCATACCAGCCATAATAACAGGATGGGTGAG 

GATATAGACAGCAGATGACAGAGAGGAGAGTGAAAGCTGGGAATCCCAGC 

TAAAGGCATCAGGTTTAT5GAATGAGTAGGGGACAATACTGTGTGTGTTT 

ATACACACATGTATATGTGTGTATATGTATACATGTTTATGTATATATAT 

AATTATATGGTACCATTTCTAATTGACAAAATAATCTATCACATTTTACA 

TT^TCAGATTTTACATCTATTGTTCTAAATACACTCAGTCATCAGCCCTG 

TGTGTGGGCTCTTACCCATCCCCATGCACACCTCAGCTCAACCACTGATG 

GATGGATCATCTGCCTATCAGAGGTGGCATATTCAGGTGAATCCATGGCC 

ACAGCTGCAGCACTTCCTACCCACGCAGAAAGGCTCCACAAGAGGAGGCA 

CACCCGCTCTGACTGTCCCTAAGCTCCTGACATCTTCACCCCATGAAACT 

GCTGCTCCTGGGTGCTTCCTGCCTTGCCCTGCCCACCCTTGTACTGTTC T 

CACCATTGACACAGCTGGTGCCCGATGCAC 

>Concig2 

NAAAACGAATCGTCACTATTGAAGCCTGTCTCTCANCGGATCGTGACTAA 

GAACCCCCTCCTTGCTTCAAGTTGTCCTGCCTTTCTAGGCAGAGCCACCC 

TACATCTTAAATATATTGATTGATGACTTACGTCTCCCTAAAATATATAA 

AACCAAGCTGTGCTCTTACCAACTTGGGCACATGTGGTCAAGACCTCCTG 

ATGCTCTTGTCATGAGTGGGTGGGTGTTCTCAACCTTGGAAAAATAAACT 

TTCTAAATTAACTGAGACCTGGGTCAGATTTTTGGGGTTCACAGCAACAA 

TTTAAAAAACTCACCATTGACCTGAAATTTTGACCTTATGCTGTTGCTCA 

CACTCCTCCATGAAAATAGACGCCATCCTATGAGTTCCCTCAGCCATGTC 

ATGCCACACTTCCAACATGTGTCCCCATCCACCATCTGTCTTCTTATTGC 

TGCATCCTACCCAGGCCCTGATCTCTGGACCCATTGTTGTATAATTAAGA 

ATTTGGGGCTGGGCATCGTGGCTGTGGCTCACTCCTGTGATCTCAACATT 

TTGGGAAGGTGTATTAGTCAGGATTCCTCCGAAGGATGCAACCCTAGGGA 

TCCTCTCTATGACCCTATGTCTA 

>Contig3 

CGCGCTCAACCGACCGATTTGCGCGAACCTGCCCATGCCCGAGGACAGTG 

TAATCCTAAAACGTCCCCTGAATCATAAGGATATGAGTGCGAAAGTACGG 

TTCCCTCTGTCACCACTTTCTAACAACGCTATGTCCGATCCGTGCACTAA 

CCCCGCCCAAGTCACTGAAACACTGATGGGCGCTTCCTCTACAGGTATCC 

AGGGCCAATACCACTACTCCCCTCCTCCCTGTCCCCCTTCCACTCTCTAG 

AGGCCGCGGATGCCATCCTCTATTAGCACAACCGAAAACGACGGTGAAAG 

TACCACGAAGCTCACGATCTGATCGGTCGCCCAATGCGGTTACAACGGCT 

GTCATCCCAACCCCCGTCCCATCCTCCATATTGCCCCCCCCTATGAGGAT 

GGCCCTATCATCATGACCTCCAAAATTCTGTCATCTCCCGACGTAATGCC 

GCCCCTCGAACGCCTGACACCATCAAGTCNGTCACCTCCCAAAATACTCC 

TCCTAATCACCAGGCCGAGTATCCCCGGTTCCACAATACCTCCTTGAGAC 
GGGC CGATATCACACAC 

>Concig4 

NGGAGTTTAGGTCAACTAGTAACAAGTGGGATTTGCGACTCAGGTCTATC 

TAATCCTCAAACCCACX3TCCTGGACCCCTACACAGACTGCCCTCCCTCAG 

TCCTCTGTGTGGCCTCAAGAAGGGTCTGGACATTCAAGTTTAAAAATCCA 

TCCAAAGAATCTATGGACCCAGTGGTCTCTGGAGTCAATGTTCTGAGGCT 

CAGAAGGGCCAGGCAGGAGGGAGCCGCCTCTACACAGTCCTGAGCAGAGT 

GGGCTGTGTCCCGGCACAGCAGGGGAGATeATAAACAGAATTCTGCCCTG 

GGCCCTATTTAAGTAGGACCTTTAGGCTGCCGGTGTCATGACCACAGGTC 

CCANGTCTGCACGATTGGCTGTGTGTGGAAAATCTTCACTCCTTGCGGCC 

TTGTCCTTGGCAGAGAGCACCGCTGCTTCCTCTGATGGCCACCAGGGGGA 

GGCGCTCCCCTGGGAACGGTTTGAANGGGAGCCTCACCCCACACGTGCCT 

TCCGTGGTACCCAGCACCAGCTGCTACCCATGGTTACCCACAGGCCCAGC 

TCTGCTCTGAAGAAGGAGGAGTGGTGGCGATCANGCCTTGTCTGCATCCC 
GTGGCTGCCCCTTTCTTTTCTTT 

>Contig5 

GGGAGCTAACCGCTCACTGGGATTACAGGTACGCACCACCACGCCTGGCT 
AATTTTGTATTTTTAGTAGAGACGGGGTTTCTCCGTGTTGGTAAGGCTGG 
TCTCGAACTCCCAACCTCAGTTGATCTGCCCGCCTCAGCCTCCCAAAGTG 
CTGGGATAACAGGTGTGAGCTACCATGCCTGGGCTTATATGTTTCTAGTC 
CAAACATTTAGCTACCTTTTl^TTTTTTTTGAGACGAAGTCTCACTCTGT 
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TGCCCAAGCTGGAGCACAGTGGCACAATCGTGGCTCGCTGCAGCCTCAAC 

CTCCTCAGGCTCAGGTGATTCTCCCACCTCGGCCTCCCTAGTAGCTGGGA 

CTACAGGTACGCACCACTACACCCTGCTAATTTTTTTGTTTTTGTATTTT 

TTGTACAGATGGGGTTTCTTCATGTTACCCANGCTGGTCTTGAACTCCTG 

GGCTCAAGCAATCTGCCTACTTCAGCCTCCCAAAGTGCTAGGATTACAAG 

CATAAGCCACCATACCCGGCCTACCTACTTTTAACTTGTGGAATTTTCTA 

TAAGGTCANGGATGCCTGNGGGAACAAAAGTTTCTCCCTTGGTATATGCA 

AGTAAAATCCACATGCTGCCTCCC 

>Contig6 

AGGACTGTAGCTGTTGTCTAGTCACCAGGCTGGACTGCTTGGCATGATCT 

CAGCTCACTACAACCTCCACCTCCTGGGTTCAAGGGATTCTCCTGCTTCA 

GCCTTCCAAGTAGCTGGGATTACAGGCATGCACTACCATGCCCGGCTAAT 

TTTGTATTCTTAGTAGAGACGGGGTTTCGCCATGTTGGCCAGGCTGCTCT 

CAAACTCCTGCCCTCAAGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCTG 

GGATTACAGGCGTGAGCCCCCGGCCCACATGTAAAAGTTTATATCTCTGT 

TGTTTCACCTTGTTTTTGACCTAGTCTTTCAGTGATTTGAATCTTGATTC 

AGTCTTTTGTTATTTTAGTGGTACTTCCCAGCTTTGTGTCATCTGTGGAT 

GACATATGAGTCTTGCTTCTTCATGCCAATTTAAGAAGACTGAACGGGAA 

7AGGTCAAAGGCATGGCCATGAGCGATTTCTCTCCAGCTTTTCATGGTGT 

TCAGCTTCAAATCTATTCACATATTGGACCTGCAAGCCATCATCTTATCC 

ACAGGCTATCATCATAGGTGAATGTAAATTGGGTTTAGGTGGCCXAGCTG 

AACGTGAGATATNTTC 

>Concig7 

AGCATGTTCTCTAAAGGCC7ATCAAAGCTGACATCAAAGGGATAAGTTCC 

AGTTACCCAGCTGAAGGGAAGGAGGGTGTTTCAGATAGAGGAAGGATAAG 

CATGACCTATTCAAGGCCAGTGAAAGAAGCGTGCAACGGCCAAGTCAGGA 

GAACCTGAAATTGTGTCAAAGAGCTTGGATGCAAAGAGCCGTGGGAGACT 

ATTGGGGGTTTTAAGCAGGGATATAATATTCATTCAAGCATGCAGTAAAA 

GGTCACTGGCACCTGCCATGGGCCAGGACTCGGGCTCTACATGATTGCGT 

CTGTTTTGGAAATATCACCCTGGCTGTGAGATGAAGAACAGGTAGGAGGG 

TCACAAAACTTGAAGCAGAGAGACTGTTGAGGAAGTAAGCTGTTTTTGTG 

TGGACTGTGGCAATCACAGAGGCAGAGGATATAAATGCACAGAGACACAA 

GGCATGTGGGAGGCAGAAGGAATCAAATACAATGAGTGATCAGATGTGGG 

GTTAGAATGGTGAGTGANAAAGACATACTCAAGGTGACACGCCCAGGTAT 

CTGGGTGGATGGTAAGACATTCATGGACTAGAATCGAAGAGGAGGTGGGG 

ATGGACATTCCTTCCGTTTAGAGGGGTTCACCAGGAGGATTTGCCGGAAC 

ATGGAGAGGATTAACCAGGAATCCGGTGCCTTTTTCCAAACTGGGTTGGA 

GGGG 

>Contig8 

GGTGAATGCTTTGGCACGCTGTGTAGATTTTAGGTGACGGGTGGTGACAA 

TGAGTCCGTGTCGAGCGCTGATTTTTTCGGCCTTTAGAGCGAGATTTATA 

CAATAGAATTTGGCATGAGATTGGATTGCTTTTAGTCAGCCTCTTATAGC 

CTAAAGTCTTTGAGTGACTAGATGACATATCATGTAAGTTGCTGATAGGT 

TTCCAGTTTTCCGCTCCTAGGTCTGCATATTGTACTTTTCCTCTTACTCG 

ACTTAACCAGTACCAACCCAGCTTCTCAACGGATTTATACCATGGCACTT 

TAAAGCCAGCATCACTGACAATGAGCGGTGTGGTGTTACTCGGTAGAATG 

CTCGCAAGGTCGGCTAAAATTGGTCZATGAGCTTTCTTTGAACATTGCTCT 

GAAAACGGGAACGCTTTCTCATAAAGAGTAACAGAACGACCGTGTAGTGC 

GAATGAAGCTCGCCATACCATAAGTCGTTTTTGCTCCCGAATATCAGACC 

AGTCAACAAGTGTCAATGGGCTCGTATTGCCCGAACAGATTAAGCTAGCA 

TGCCAACGGGATAAACGAGTCGCTCTTGGTGGAGGG 

>Contig9 

GGGGTGGGGCGCCTGGTGTTTCTAAAGAGGATCTCCTGCCAGAAATGGTG 
TGCTGACACTGTTGTCCTCCTTGGTGTGGAACTTTGGTGGGAAGAAAGGT 
TGGAAAGGGAAATTTTGATCCTTGGATTTAACCCGAGTTTGTTACTGATG 
CTCACAAGACTAGGGGAAGGATAAAGGCAGGTGAGTCACTCTAGGATGGC 
TCANTGAGCTCCACAGAGCTGGAACCACAGGCACC1AGGAGGGATTCAGAG 
CAGGCCTCAGTGCACGTCAGCTGAGTGAACCAATGAGCAGGTGATGGGTC 
CAGGCAGAGCCCTGTCCTCTTTAGGCAAAAACCCTTGAAACACCGTTCCC 
ATCCTAGCCTGTGTTCCACCCAAAGCTGGCCAGTCTCCAGGCCCTGCCTG 
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AGCCCCAAGGAAGTGGTATGGTGAAACAGAAGGGCCATTCCTGTCCAATG 

TGTGAGGAACTTCATTTCAGACTTGTTGGAAGCCCTGATGTTCAAAAACC 

TCAATGATATCATTCATTTTCCCCATCCATTCAATGCCCATCCAATGCCC 

ATCCGTTCAATGCCCCTTCCATTCCTCTTCAGGGAAATGAAAATTGTTCA 

GAAATCCTTTCTCTTTCGAGAAACCAACCAAACCAAAACCGCGAAATTCA 

CTAAACTAGCCAAGACACAATCCTGGGTTATTTTCCTTTTCCCAAACCTC 

CTeTGTTTAAATTAATTCTACCCTGGTTCTCGGCCCTTACTGCGAAGGTG 

AACTCACCTAACCTCTCCCAAACAGAGAAGAAACTTCTCTTGGTAAAATG 

GGTTTTAACACTTCTAAAAAACCCCC 

>ConciglO 

GCTATGGTTCTAAAGGTAATGGACTATGGCGTACACAACGTCTCGCTCAT 
CGTCTGCCAGGAGGCTAAGGTATCCACGGACAATCGCTGAGCAACAGTGT 
CGTTGATCCATCTCTGTACGCACTTGTCAACATGGCAGGAGTACGGGAGC 
TGCGAGAATCCTCTCTGCTGATGTCCCACGGAGCATGCCGTGAGACAACG 
CCACGAACGGCCCTCGGAGANANCTACTCTGCAATGAAGACGTACGATAC 
ACACGTAGGAGTCCTAGCTCACCAGCCGTATCTAGGTATACTGTACTCGC 
GGATACTCACTCGTGCATGCGGCAATAGATCGATACGCAGTCGTCACGCC 
CATGCTCTCAGTGTGTGACCTTCTGGCGGTAGCGTNGTGGGCGCTATTAC 
TGTGCGCAGCAGGCGCNTCGTACATGTGTCGGGTAGCGATGCCAGGAGCT 
GTAACATAGCAAGTCGCCCCCCTACTCCTATCACTATCCCTACGCTGGAC 
CGCACTCGAGATCTGAACGCACGTCTTAACCTGCCAGTACTCGTGAGACC 
TATACTGCGCAAGCCTTGGCTAGGAGATCCTGCAGCGCCGGCAAAGAATC 
AGCTATGATCCCCTTGCGATTATCGCACACGCACCATAGAGTATGTGCAT 
ATTAACCTCTGAATGTGCTGCAAGCAGACGGTTGCTCAACATATATATGG 
ATGTGGGGAAATCGCCCTGGTCACCGCCACTTGGCGTCAGGAGGCACCAG 
CACGTCTGAGTGTCACGCACGTTACTC 

>Contigll 

GGCCGAATGGTGAATTCATCCGTCGTCTCGAGGGGGTGAAAGACGGGGAG 

TTATGCTGTAATGGCACCGCTCACCCTGGGCTTATGAGCAGACCTAACCC 

TCCCANAGTGCTGGGATTACAGGCATGAGCCACCGTGCCCGGCCCAGTAT 

CTGAACTTCTGTGGCCAGGCAGAAAAGGTCCTGTGTTACTCGTCTCCTTT 

ATCATTCATGTCCATATTCTCCCATTTGCTAACATTTATGTTTCTGCTCC 

ACTGGATTCTTTGGATTTTTCTAGAACATACCCATGCTTTGCATTGCCTT 

GGTCTTTGAATATTTGGTCCACTTTTCCTGCAAAGTCCCCTCTCACCTTA 

TCTTCCTGGTAAACTTCCAGCCAACACCTCTTTACTAACCAGAGAAACAT 

GGTTCAACTGTGCACAGGCTTGCACAGAAACTGTTCTCATATTGTCTTGT 

CATTGTCAATGTGGCAGAGATGCACCTTAGATACCTCTTTGAGAAAGGAC 

TCACTGCCCAGCTGCCTGGCACGTGATGAGCTGATAGCTCCAGCTATAGA 

C7CCTTTAGGGTCAACCTCTGCTTTCCAGTTGAGATCATATCCTTTGCAG 

GGTGGCCTCCCCAGTGATGACTAAGGCAGTGTTACAATGGCCTAGTCATT 

TCCTCCCAATGCTGGACTCCCAATGAACCATCTGCTCCGGAGCTTCCCAC 

TGGGCAGTCAGAGACCTTAGCTAGTCTGCCTCCGAATCAGAAGGCTCTCT 
CTTGCCACTCTGGCC 

>Concigl2 

GCTGTGTCTAAAGATTCACGGCTGTAGTTCCAACTCCCGCCGCCCTCTAC 
TGTGTCCTCTTAATGGCAGTCATTCACCATCTTCCTGTCCCTCCCCTTCA 
TTTCTTGGATGGTGACTGTCACTTTGCTGCAACAGAACCCTGTCCCAATC 
CTTGATGGTTCAATACACACATAGACATTCTTTTTAACAGGGCGGCCTCT 
CAGGTCTTTAATTTTCTTCCCTCCAATAACCTTGTGATGATCCCCCAGCT 
TAGCCACTTACTGCCAGATCATTACCAGTAACTCCAGCCCCTCCTTAATT 
CTAGTTTCTAATATCCTAATCTGTGACCTCACATTCCAACTTCTTCATTC 
TTATCCCCTGAGTCAAAAAATCCTTTGATCCATGCAATCCATTAAGTCAT 
CTACCTTTTCZACCATTCTTCGCCCCACTAGGGTTCTCATTCCTTTATTAC 
CCATATGAAATTCCAAGGCCTGTTGGAATCACTCCCTTGCAGCCACTGTC 
AATACTTCTGCCCCTTTTACTTCATCACCCTTATGTGGCAAAACCACAGC 
CCTGGTGGAGTCGATCCTTACCCCTGCTCTGTGCCAACAGCCGCACACGC 
ATGGCTGATGGAGGTTGGAAAAATCCACACATGCAGTGGGCCCTGTATGT 
CCATATACGTATCCAACCTCCAGCCTTGCATATGCCTCAGTGCTGCCTGA 
CAACACATTATATGTTTTCCTTAGTTCCTTCAGTCTCCTGGGTGCCTAGG 
TGAGTATCTCAGACATCCTTCTCTCTCTGCAAAGCTCCAACACCTCCACG 
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TCACATTCAACTGATGACTGTGTCTCCTATGTC^CTTAGATCAd^GAGGC 
ATACATAAACAAATCCCAGCCACTGCCAGCACTCTGCACATCTGCGAGCA 
TGGCACCCCCAATCTAGGCCTTTCC7GCTGTCACTTGGGGTGAGCTGATT 
ATACTCGATCCTAGTCATTTCTACTTATGCAC 

>Contigl3 

CTTAAGGCCTCCCTCTAACATTTTAATTTAAGATTGAAAAAGCAAAGATT 

ATXCTGTTTTGGC7GCGCCTATAGTAAAGTAACCCCTATGNCAAATTTTG 

ACACCTTATAGTATTTGACAGGGATAAGTATAAAATTGCTTGATTGATAC 

ATCCACACCCAAATGTATGCTGGGAATGATTTTGTTTCACGGCACTCATT 

ACTTAATTTTTAAAACTCTTATTTAAATTTGCAATGTTTTAAATGACCAT 

CACTTAAAGTAGTAATCAACAGAGGTTAGGAGAACATAACAATACTCTTT 

CTCTTAGAAAATACAACAGAAATATAATTTTTTACAGTTTTGCTCCCAAA 

CTTTTCTCTGTAATAACATGCCTTACTCACCTTTACAATAGGTTTGTTGT 

GAGAATCTTGTAATGTAAACCCTGGGTGTTCTGTGAAGCATTTTTAAACT 

TCTAGTTTACACTGACTCTTATTCAAGTGTTTTTAAAAATATATTTAAAA 

AACTGGCCAGGTGCAGTGGCTCACACCTGTAATCCCAGCACTXTGGGAGG 

CCAAGGCGGGCAGATCACAAGGTCAGGAGTTTGAGACCAGCCTAGCCAAC 

ATAGTAAAACCTCGTCTCTACTAAAAATACAAAAATTAGCTGGGCGTGGT 

GGCGGGCGCCTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAGAAGAATCG 

CTTGAACCCGGGAGGCAGAGGTTGTGGTGAACCAAGTTTGCGCCAATGCA 

CTGCCAGCCTCTGCAGNGACAGCC 

>Concigl4 

GGGGGCGGGCCGAGTGATCCTAAAGCCCGCTCGCTTCACAACAAAGCCTA 
ACAGTCCAATCACTTAATGCTGCATTTATTCCTGGGGAAGCAAGTCTCCT 
TTGCACTTTACACAGTGAGATAATCAGTTTCTCATGTGGACCACTGGGCC 
AGGAGGGCCTGACAAAGGGCAGTCTACATTTCAGACTGGAAACTGCTCCC 
AGAACTATTTCTTTCTAGTTCCCACCTCGGTCTGAGGTGCCTGAGGAGAG 
GGACTCAACAGAGGAAGCAGGAGCATAGCTCAAAGTGTCAGAACATGGAA 
GAGGAAAAGAATCCTCACAAGATTACGTAACTTACAGGCGTGTTGCTGCT 
TCAGTAGAAGTTTCATCTCCCTCAATCCTGTACACTTTTCCATACATTAC 
ATACTCAAACTGGTCAGCCCTATGGAGCAATAGCAGCAAAGTTATTCTTA 
ACAGTAATT AACAATATAAAAGAT C C CATTTAAAAATGGTTACTGGTCAG 
CCGGGCGTGGTNNNTCNANCCTNTAACCCCANCACTTTGGAAAGCATGCG 
GGCGATCCCAAGTCTGATATCGAAACATCTGCCTAACATGTGCAACCCCT 
CTCTACAAAATACAAAAAATATCCGGGCTTGTGTTGGCGCCGTTATCTCA 
CTACCCGGAGCTAAGTAAGAAATGCTTTACCTGGAAGCGATTTTTTTACT 
TATATCCCCTCTCTTCACCGGGCGCGACCAAATTCTTTAGTATAGGAAAG 
TTTATTGTTTTATGCCTTTGTCAAGGCTCTACTG7ATCTTTTCTGTCCAC 
TCAC 

>Concigl5 

GGTTCTGAACAACAGCAGGCGATTCCTAGCCCTGTACCCGGGGCATTGTC 

CAACACTCGACAGGGCTGAATTCGTCCATAACGGTGTGCCCCTCTGGGAT 

ATAGGATGAAATGAATTGATCTGAGTACCTGGGATGTAAAGTTACTAAAA 

CGCCAGCTAGGTTCACGCCCCGATGCTTAAATATGATCGTGGCCTACACC 

TCGTCCAGCAGAAAAAGTACCCTTTCTTCAACACCACCTCACGATCCTCC 

AATTTAGGAGCTATAAAACTCATGACTCTTTATTTACCCCCTGCAGATTC 

TCAATCCAATAGTGTGTGTCTCCCTGTGAACTCACGGATATACCGATTTT 

CCCCACGTCATTTCCACACGTCGCAATCGCTTAGTCATCCCTATGTATGA 

GAATCATGGATGACTATGTTGAAGTCCATCTATAAAGTTCAACCCCCATC 

TCCGTCCCTGATTCCCCCTCCCCAAGATCACCAACGCGACTCGACATATT 

GTTATCGCCCAAGGGACCTCTTGCATCCCCCATATCCACTGGTCACCTCC 

CCTCTTGGCTGGAAGTCACCGGGAAGTTCTCCACATGTTGT 

>Contigl6 

TGCGAGCGATGTTCCTAAACTTTAGCGCCATTGACTCGAGCATGGTCATG 

GCTGTTTCCTG 

>Concigl7 

AGGGTGTTCCTAAAGGATACTACGTTCCCTAAAGTCCAGAGAAAAAAAAA 
AAAGTAACATAATGTGGCTTATTTGGTATAAAAATTTTACAGGAAGCATT 
GTCAAATATGAAATAGTGTTTGGTTTTGTTTGGGCTGTATTTGTATAAAT 
ATGTTATTGGTATGTGTTCCAAAATTATAGGAAACTCCTATAATTCTGAT 
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ATGACTTGGTGTACATTATCAGTAATAATTATAATTGTTATGGTAAATTA 

TTGTGTGCCATGGAGGTAACAAATTTCCTCATCAAGTGTGTCTTTGACTA 

TGGTTGCCCTAAAACTTTTTGCCATTCACAGACAATTGTCTTGCTTTGGT 

CCTCTTTAGAAGGTGGTtTTATAATCAGCTATAAAACTCTAACGGGTGCT 

CTTGAATGCAGGCTTAAGATAGCTTTGGAGACTGTGACATCAGAATAGAG 

GAAAAACTTTCAGTATTCATGGAGTGCTGAAATATTCATGAATATCAAGC 

AAAACAGGAATTAACTTCATAGATGGAACTAAAAGAATGCTGAAGTAATC 

TTTTTGACTTTTTTTCTTAAAATGTTGATCCTTCGTTTTGTTTTTCAGAG 

TCAAGGAAATTTTTCTGTTGAGATATTGACAGCTTTTAACAATTAAGTAT 

ACTCCAGTGAACACAATTTGGAGCATATTTGTGTCTCTCTATATATATTT 

GGAAACAATNTTTGAGTATTCTTAACTTATTGCAATATT 
>Contigl8 

GGTTGTCTGCTATACCAGTAATGGGATTGCTGGGTCAAATGGTATTTCTG 

GTTCCAGATCCTTGAGGAATTGCCACACTGTCTTCCACAATGGTTGAACT 

AACTGACACTCCCACCAACAGTGTAAAAGCATTCCTATTTCTCCACATCC 

TCTCC^GCATCTGTTGTTTCCTGACTTTTTAATAATCGCCATTCTAACTG 

GCATGAGATGGTATCTCATTGTGGTTTCAATTTGCATTTCTCTAATGACC 

AGTGATGATGAGCTTTTTTTCATGTTTGTTGGCCACATAAATGTCTTCTT 

CTGAGATGTGTCTGTTCATATCTTTTGCCCACTTTTTGATGGGTTTTTTT 

TTCTTGCAAATTTGTTTAAATTCCTTGTAGATTCTGGATATTAGCCCTTT 

GTCAGATGGATAGATTGAAAAAATTTTCTCCTATTCTGTAGGTTGCCTGT 

TCACTCTGACAATAGTTTCTTTTGCTGTGCAGAAGCTTTTCAGTTTAATT 

AGATCCCATTTGTCAATTGGCTTTTGTTGCAATTGCTTTTGGTGTTCTAA 

TCATGAAGTCTTTGCTCATGCCTATGTCCTGAATGGTATTGCCTAGGTTT 

TCTTCTATGGTTTTTATGGTTTTAGGTCTTATGTTTAAATCCTTCTTTTT 

TTXTTTTTTTTTTTTTTGAGATGGAGTCTTAGTCTGTTGCCCAGGCTGGA 

GAGCGAGTGGCGTGTCTNTAGGACGC 

>Contigl9 

GCATGTTGTCTAAAGGTTTGTCTTCCTCCAAAATTCATATGTTAAAACCT 

AGCCCCAAATGTGATAATATTTGGAGGAAGGCTCTTTGGGAGGCAGAGCC 

CTCATGAATGGGATTAGTAGCCTTATAAAAGAGACCCCTGAGGGCTCCCT 

TGTCCCCTCCACCGTGTAAGGATGCAACAAGAAAGTATGGTCTATGATCC 

AAAAAGCAGACCCTTGCCAGGTACCCAATATGCTGGCACTTGAAGTTCCC 

AGCCTCCAGAACTGTGAGAAATAAATTTCTATTTTTCATAAGCCACCGAG 

TCTATGG7ATTTTGTTATAGGAGCACAAACAGACTGATGTGCCACCCAAC 

CATGATTATACGTGTAATTTATGGTTTCTCTGCTAGTAGGGATGCACCAT 

GGGGTTAGGAACCACGCTTTTCTTATTTCCCACACAGTCCTTAGCTCTAA 

GCATGTTCCTGAATCAAAGATCCCCATCTTTTATGAATGAAGGAGTCAGT 

GAATGAATTAATGAAAGAACTGATAACCCTCAATAATTATTCCAGCCTTT 

TATACCTACTATTAA 

>Contig20 

ACGGTTCTCTAAAGACTTTCAAGAGCTGGATTTTATGCTTTAGGTGAAGG 
TGATAAAGTAAAGTGCTTTCACTGTGGAGGGGGGCTAACTGATTGGAAGC 
CCAGCGAAGACCCTTGGGAACAACATGATAAATGGCATCCAGGGTGTAAA 
TATCTGTTAGAACAGAAGACACGAAAATATATAAACAATATTCATTTATC 
CCATTCACTTGAGGAGTGTCTGGTAAGAACTGCTGAAAAAACGCCATCAC 
TAACTAGAAAAATTGATACCATCTTCCATAATCCTATGGTACAAGAAGCT 
ATATGAATGGGGTTCAGTTTCAAAGACATTAAGAAAATAATGGAGGAAAA 
AATTCAGACATCTGGGAGCAACTGTAAATCACTTGAGGTTCTGATTGCAG 
ATCCAGTGAAGGCTCAGAAAGACAGTACACAAGACGAATCAAGTCAGACT 
TCATTGCAGAAAGAGATTAGTACTGAAGAGCAGCTAAGACACCTGCAAGA 
GGAGAAGCTTTGCAAAATCTGTATGGATAGAAATATTGCTGTCGTTTTTA 
TTCCTTGTGGACATCCAGTCACTCGTAAACAATGTGCTGAAGTGGTTGAC 
AAATGTCTCAAGTGGTACGCAGTCATTACTTTCAAGCAAAAAAATTTTAT 
GTCTTAATCTAACGCTATAGTAGGCATATTATGTTCGTATTATCCTGATT 
GAATGTGTGATGTGAACTGACTTTAAGTAATCAGGATTGAATTCCATTAG 
CATTTGG T AC CAAG TAGGAAAAAAAAAT GTAAAG CCAGTG C TTAGACACA 
GC 

>Concig21 

CGCTGTCTTAAGAACTGGGCTAGGAGTGAGCAGTGAGCCAAGATCGCACC 
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ATTGCACTTCAGCCTGGGCAACAAGAGCAA^ 

CATATATATATATGACCCATAAAAAGGAGATAAATCAACACTTCAGAACT 

GACCCAAACTTGCAAAGATACTATAATTAACAGAAAAGGACAGTTTACTA 

AGTACTCCGTATGTTCAACAAGTGAAAGATTAAACATATTAAGTAGAGAT 

GTAGAAGATATAAGAAGATCCAAAATGAACTTTTAGAGTTGAAAACTACA 

ATATTTAAGATAAAAATACACTAGGTGGGATTAAAAGTAGATTACACATT 

GC51TAAGATAAAAAAAAATGAGCCTGAATACAGCACAGTATAAACTATCT 

TAAACAAAAACACAGAGAGAAAAAATAACTTTAGAGACTTAGCTCTTATC 

CTCTATTTGTTTCTAAACAGAGGATAAGGGGCAGAAAAAATGTTTGAAGA 

AATCATGATTTTTAAATTTCCAACTGAGATAGGAATAGCACTGGGTAGTC 

ACAGGAGGCTGGAAAGACCCAAACAGCAGTTAAAACAGGAACTAGGCAAA 

GAAACCAAAGGATAACAGTAAACCTAAACTAAGGGAGAGAAAACTGACAA 

AAGCTGACTTAGGATAACTGAC 

>Contig22 

CCTGAATATAAGCCGCAAGTAACCAATTAAATTTGTTTTCCAAAATTGTA 
TTAACAATCTATGAAATTTTTATCTTGACCATAGCTATAACTTCCAGAAG 
CCTTTTATAACCTCTATAACGTTTATTAAGGAGTAGGTTAATGCTTCAAG 
AAAACCTTGTTAATCTGACACAGGACCCATATGCTGATCTTGCATCAGTG 
TGGCTTGGACATCAATGATTATGATTAATTTATAGAGAAATTGAACTTAT 
TTTATCTCTCAAAATTGGCCCTTACAATCTCACACACCCACCTCTTCCAC 
TATAGTTCCTGGGCCTTGAGTTGAATAGCTTTAATTTCTGGCTCITCjTGTT 
TCAAGAATGCAGTTTATTTTGATTGGCATTTTCTACCAGTCCTGAAGATG 
AACCTTTAATTGCTGTCAGTATTTAAGATTTAGCAGGACTTGTCCTTTTA 
AGAACCAGGAGTCAAGCCCTATAACTCAATG7CACAAGGACTTTAAAAGC 
ACATACATAAAGATATATGGATGTAATAATCATAATTTTTAAAAAATTGT 
ATTAATCTCAGTGTTTTCTAAGCAAACCAAAACTTAATAATAATGGCATA 
GAAATTATTTCAATAAAACATAAAATCTGTTAAGCCAGTTACCAAAAGGC 
AAAAGAAAAGACCTTCTGCAATGCACAGAATATTATGTTGGAAGAAAACA 
TTTCCTTTAGACCTTTAAGAAAACATTGTTAGCATCAGGACACAACAAAC 
AGAATCTGAGGGTAAAAAACGTATATGAGCTGAAGGGAGTTGAAGGAGGG 
CATTACT ATTTC C C AC C C TTTTAAAGGGGAG AGAAAAC CTAAAACAGCAA 

GATGCAATAAAAGCTGAACTTTGGGTTAAAAAAAAATTCTTAAGTCTCTT 
ATAATTTATTAAGAGTGAATCAACCCCGTAAGAAAATTTCATTGTTCTAA 
CCAATTTTTTAATATATAAGTAGTTTTTTAACATCAACCCAATCTCTAGA 
AAGACCATTATAATTTCCCTTTAATTATAGACAACTTTATCATATAAAAG 
TTTTTTTAAATAAATCCTCTTATTGTGACTTACACAGACTATTCATGACA 
TGCTTGGACTTTCTGGTTTGTCGTGAACATCCTTTTCTTTCTTTCTTTCT 
TTTTTAAATTTTACTTTACGTTCTGGGATACATGTGAAGAACATGGAGGT 
TTATTACG7AGGTGTACATGTGCCATGGTGGTTTGCTGCACCCATTAACC 
CGTCATCTATATTAGGTATTTTTCCTAATGTTATCCCTCCCCTTGCCCCC 
CACCTCCTGACAGGCCCTGGTGTGGGACATCCCCTCCCTGTGTCCATGTG 

TTCTCAATGTTCACTCCCACTTATGATTGAGAACTGCAGTGTTTGGTTTT 
CTGTTC 

>Contig23 

GCTAAATATAAGCTATGATAAAACAGTTGGCCCTCTGTATCATGGGTTTC 
ACAACTGTGGATTCAACTAACTGTGGATGAAAAATACTTGGGAAAAAAAG 
AATGGCTGCATCTGTACTGCACAAGTGCGTGCTTTTATTCTCGTCATTAT 
TCCCTAAGCAATACAATATAACAACTATTTATATAGCATTTACGCTGTAT 
TAGGTATTATAAGTAATCTAGAGATGATTTGAAGTATACAGGAGGATGTG 
CTTAGGTTACATGCAAATATTATGCCACTTTATATAAGGCCCTTGAGCCT 
CCTCAGATTTTGGTATCCATGGCAGTCCTGGAGTCAATTCTCCTGCAACA 
TCTCCATTTG TTCA GATTCTCTTCTATATCATGTTTATATCAGAAAATCT 
ACATAAGATTTTTTAATGTGTTCATATAGGTTTTGTGTATTTTTGGTTGT 
TAATCCCTAGATATATGCAGTATTTATTGCTATTATGAGTAGTGTTTCTT 
TACCATGTATTCTAGTTGGTTATTGCTGACAGAGAAATGTTGCTGGTGTT 
TCTAAGTTACCTTGTTTCTAACAACCTTGCTGAACTCTTATTAGTTCTCA 
TAGTTTTTAATTAATCTTTCTTAGTTCTGATAACATAATCTGCAAATAAT 
GACAATTTTATATCTTTCTTTCCAATGCTTATATCTCTCAGTCCTCTTTA 
TCCCAAAGTATTTTCCAGGATCTCCACTATAACATTAAATAGTAATAAGA 
ATTTCTGTCTTGTTACTGATCTTAAGGAGAATAAATTTAAATTTCCTCTG 
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TCAGGTTTTATGCTTGATATAGATTTGTGAT^^ 

AAAAAAAAATGCTTTCCZTAGTAGTCCTAATTTTTTAAAAAAATCATCATA 

AATAGATGTTGAACATTATCAAATGCTTTTTCTGCATCTATAGAGATAAT 

CATATGGTTTTTTACTATTTATTAATGTAATGAATTAGACCAATTTTCTA 

ATGCCAACTCTTTCTTGTATTTGTAGGGTAAATCCTATGGGATCATAAAA 

TACTTTTAATACATTGTTAGATTTGAAGAGTTAACGCCTTATTTAGAACG 

TTTTCAGTCACATCCATAAGTGAAATGGCACTATAGTGTCTATTACTATT 

ATATTTTTCTGGTTCTGAAACCAAAATTATACTCACCTCATACAGTAAGT 

TGGGCAACTTTTGTTCTTTTTTTCTGAAACAATTTGTGTATAGAAGAAAT 

TAACTGTTCCTTGAAAGTTTGATAATAATCATCCAGAAAATTATCCCCAT 

CTAGGGCTTTTACAAAAAGGAGACTCTAGAATGCCATTTCGGTTTCCTTG 

ATGTGTATTGGCCTCTTTCATTTAGGCTTTTGGATTTTTTAGGGCATTTT 

TTCACTATAGGCTTTTTACCGG 

>Contig24 

CATAAACTTCAGGTTGGATGTTCGGTCAAAGTGGTCCGGCGATGCGAAAA 

CGAGAGGGCTCGAGGACTGGGCAGAGAACTATTTGAAGGTATCTCTCAGG 

GGAAACCAAGCGGAAGGCGGGGAGTAAAATTGGGAGGGAGCGACGGCCTT 

CAAAGAAGGGGCTTGCATTAGATCGGCGAGATCCGGGAGGGTCTGGTGGG 

GAGAAATGACTAGAGGACAAATCTAATGGAGAGACAGACGGAGATAGATA 

TCGTGACAGAGAGAGGGACAGTGACAGCGCACAACAGTGCAGGGTCCATG 

AGTACAAGGCCCTTAAGTGTACACCCCAGCCGGAGTCATGGCAATTCGAT 

TCCTGTACTGACCACCCAGGATTTGGGTAGACTGTACGAGTTAATGAGCA 

TGGTCCCCAACAAGACTGCTTCGACCTCAGATGCAAAGCACACTTCAGGG 

GTCCCCAAGCCACTCATGTTTTTTGAATGACTGCCATAAGTTCAAAAATT 

CCCACAATTCTCTCAGATTCAATAACTGGGTATAACCACTCATAGAACTC 

AAGAAAATGCTATCATTATTATTACAATTTTATTATAAAGGATACAAATC 

AGAAGGA C T AG C CAAATGAGGAG ACACATAG AGAGAGGAC T AGT AAAAAA 

CAGAGCTTCTGCGTCCTACCTTCAAGGAATCAGGATGCACCACCCTCCCA 

GC^CATCAAGTGCTCATCAACCAGGAAGTTCCTCTGAGCTCCAATGTCCA 

GAGATTTTAGGGAGGATTCATTACATAGGTATCATTGATTAAATCATTGG 

CCATGTACTTGAACTCAATCTCCAGTGTCCCTCTTCTCCCTAGAGGTCTG 

AAGGGTTGGCTAATATCATGTGGCTCAAAGCCCCAACTCTAATTACCTTT 

TTGGTCTTTTCAGGGACTAGACCCCATCCTGAAGCTATCTACAGGCCCTG 

CCATGAGTTAGCTCATTAACATAACAAAGACACTTATATTACTCAGAAAA 

TTCCAACAGTTTTAGAAGCTCCATGTCAGGAACCTGGGACATAGATCAAA 

TTC Trrrr TTTTTTTT TT TTTTGGAGACAGGGTCTTGCTGTGTTGCCCAG 

GCTAGAGTGCAACGACAGATCACAGCTCAATGCAGCTTCAACTTCCCAGG 

CTTAAGTGACCTTTCCACCTTAACCTTCCAAGTATCTGGGACCACAGAAA 

ATGGCTAATTATCCTGGCTGATTTTTAAACTTTTTTTTTTTGTAGGGATG 

GGATCX5CCCTGTGTTGCCAAGGTTGGTCTCAAACTCCTGGGTTCAAGCAA 

TCATTCTGCCCTGGCCrrCTGTGATGGTTAATACTGAGTGTCAACTTGATT 

GGATTGAAGGATACAAAATAATATTTTTGGGTGTGTCTGTGAAGGTTTCG 

CCAAAAGACATTACTTTGAGTCAGTGGACGGGGAAATCCCCCCTTCCCCA 

TGGGACGGGGAGACCCCCCTCCATCCAGGTAAAAAAATCTAATCACCTGC 

AATGTGGCAGAAATAAAGGAGGGAAAAAACGGGGACCCCTANATGGGTTA 

TTCTCCACCTAATTCTTCCCCCAGG 

>Contig25 

CCATGT^TTTCIATTTCTACAGACCCTGAGATGAATTTGTCATTGCCACGG 
GGTCCTGAAGTTCAAATACTCTATTTGGTATCCTGCCCCTGTGGTTAACT 
GTGATCATTTCACTCACCTTGTTTATGATGAGAGGTGCCACCATCTGGCC 
TCCTCCACTCTGCAATCCTGTTAATTCCTATCAAAGCTGAAAACCTGCTG 
CAGCACCCACACCATCACCTCCAGCCTAGAGAGGGAAGCTACCAGTGAGC 
TCTCCTGGATGCCGGTGTGCCCCTCGCCAATACATTTCTTCTTAGTCCCT 
TGGTCATCCTGAGGTGTGTGATTAATGGACAGCTATGTGGATTGCACATA 
ATAGATGTACTCCAGCATCTTCATCCCTGATTTTCCTTTACAGAAATCAC 
TCAACCZTTAGCAACZATGTGAAAATCACCTAAGGACATTCTTTAAATCCCT 
CTGTCCACATGGCAACACAAACCACTTAAATAAGAATCTCCAGGGAGTCA 
CTCAAGCATCAATGTTTTTTAAAGCTCCAATTTTAAGGATCZATTACATTA 
TGTCGAAGAAATTATAGTATTTCZAGCCTTACTGACTGTAAACCACCACCA 
TATCTAAGCATCCATTAGTCAACCTAGCAGACAATAAACTAACATTACCT 
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CCAGGTACTO^TCAATTC^TTGCATCC^ ~ 

CTTATTGACAAATTCAGCCCAATCTTGGTTGAACACATTTAGAATATATT 

TCCATGAACAATATCCGGTTGACGAGTTTCTTTAACTTTTTGGAGTTTAA' 

GCCATTTCCTTTCACAGTAGCCTTGTTAATTCCCTGTCAATGCTCCATGG 

GGGTCATGAAGAGACCTCTTATTAACTGTGAAGCAACTTGGCTCAGGTGC 

AGACACTCAAATGCTTCACATGCAGTGGGAAAAGAGAGTGATTGTCTAC 
>Gontig26 

TTTAAAAAGAACTGAGTCTTTATTCAGTCGATTCTTCTAATCTATGAACA 

TAGCATCTCTCTCAAAGCATTTAGTCCTTCTTTAATTTCTGTCATTAATT 

TTTTAAAATTTTCATCCTAAAGATTCTGTATATGTTTTGTTGAATTTATG 

CTTAAGCATTTCACTTTCTTGGTAACAATTATAAATGATTTTGTGTTTTT 

TATTCCACTAGTTCATTTTCAGTGTGTAGAAAAGCAATGAATTTTTGTGT 

GTTGATCTTTGTTCCAACATCTTGCAACATTATTGAACTCATTTATTAGT 

TCTAGGAGGTTTTTTCATTTTTCTTGTAGATACCTTGAGATTTTCTATAT 

AGACAGTCATGTTGTCTGCAAACAGGCACAGTTTTATTTCTTCCTTTTCA 

ATCTATATGCCTTTTTTTTTTTTTTTGCCTTATTGCAGTGGGTAGAACTT 

CTAGCACTATGTCAAATAGCATTGGTGAAAGCAGACATCCTTGTTCCTTG 

TCTTAGAGGAACATTTGGTCTTTAATCTTGATTTAAAAAATTCCTTGCAC 

TAAGTTACCGTGTTTTGCGGGAGGGAGAGGTGGGGTGAGGTGGGGATTTC 

CCCTAATGTTTACAAGCTGGGATTTTCTTTTTCCTGTGTCTAATTATTTT 

CCTCATTGGC7TGAAAAATCTGATAAAACATTTTAGGACTGTGTATAAAA 

TAGAATTAGCCAAGTGCAATGTCTTTATTCAGAAGAAATTTCATGGACGT 

TGTGCCTACTCTCTTGGCTTCCTGGCTTCATGGCTTTCCAGATCCCACAG 

TAAGCTCTGGATAGTAGAAGTTATAGTAAGACTGACTTCTAAATAAATGA 

AGTGACTTTAACCTTACTGATATGGCTTAAAGAAAAGGAGTGGCCTTTAA 

GATCCATGAACTTCTCAAACAAAAGTGATAACGTTATCTCCATGCATATA 

TAATACTAAATATAATGCAACTGAGAGAAGTAGGCTGTGGTAAGAAAGGA 

GACCCAAGTGCCATCTGAAGGCAGCACTTACCACTCTGCTTCATCCCACC 

GAGGAAACAAAGCATGAGTATTGCCAGATTTTCTTCTGTTTCAAGAAAAG 

CCAGAAATCCAGGTTTTTGCGTGAAATGTCCTGATTTTAATGTTGGGAAC 

TAATTTATATTTTGAAATAACATTGTGTGGGACAAGTGAACTTGTATGTG 

GAACTGCTTTCTCCCAGTGGCGACCAGTTTGGACCGTTGATACTCAGCAA 

GTTCAGCCAAGTGCGCCTTGTCATTGTCAGTCATCAAGGTGATGTGTGAT 

TGGTCAAGCAATTAATTTTGCTCAGCATCTCGTGTGTTTTCAAAAGAACT 

GAAGGTTCATTTGC 

>Contig27 

TTTCAGAGCACAATGCGTATTCATAGTATATTGACTTAATTTCTAAGTGT 
AAG7GAATTAATCATCTGAATTTTTTATTTTCAGATAGGCTTAACAAATA 
GAACATTCTGTATATAAATGTGTAAATTAGAGTTAATCTTTCCAATCACA 
TAATTCGTTTTATGTGAAAAAGGAATGAACTGTTCCATGCTGGTGGAAAG 
ATAGAGATTATTTTTAGAGGTTTGTCGTTGTGTTTTGGGATTCTGTTTTC 
TTTTAAAATTGTAAATATGTACTTGTGTGAATGATTTTTTAAAATGATTT 
TACCATTTTTGGAAGGGTATTTAATGATAGAATATCATCGAGCCAACATG 
CACTGACATAGAAAGATGTCAAAGATATATTAAGTGTAAAATGCAAGAGG 
GAAAACACTATGTACAGTCTGAGCCAAATCAAAGCATGTATGTTTTTTAT 
ATGTGTACAACAAAAGGTTTGGAAAGATATGCGCCGAATTGTTAAATGTG 
GTTTCACTTGAGGGGGTGGGAGGATGGGGCCCCAGAGGGGTTTTTATGGG 
GGCCTTTCACTTGGTATTTTTTTCATTTTGTTCTGTTTGAAATTTTGTTT 
TTTCTTTTTAAATGGAGTTTCACTCTTGTCGCCTAGGCTGCAATGTAGTG 
GCGTGAACTCAGCTCACTGCAACCTCCGCCTCCCAGGTTCAAGTGATTCT 
CCTGCCTCAGCCTCCCATGCCTCCTGTGTAGCTGGGATTACAGGCACCCA 
TCACCATGCCTGGCTAATTTTTGTATTTTCAGTAGAGATGGGGTTTCACC 
ATGTTGGCCAGGCrGGTCTGTAATTCCTGACCTCAAGTGATCCACCCACC 
TTGGCCTCCCAAAGTGCTGGGATTTCAGGTGTGAGCCACCACGCCCAGCC 
CTGTTTAAATTTTTTATAAGTATGTACTACTTTTGTAATCAGAATTATTA 
GAAAGCATTTTACTGATTTAAAAGCTTAGACATGTTCAAATGCCTGCAAA 
ACTACTTAACACTCAGCTTTAGTTTTTCTAATCCAAAAAGGCCGGGCAGT 
TAATCTTTTTGGTGCCAATGTGAAATTTAAACGGTTTTATGTTTTTCCTG 
TGTTGTGAATGAAAAATATTTCTGAGTGGTGGTTTTTTGACAGGTAGACC 
ATGTCTTGTCTTGTTTCAAAATAAGTATTTCTGATTTTGTAAAATGAAAT 
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ATACAATATGTCACAGATCTTCCAATTAAGTAGT 

TCCTTGCTAATTTAAGCTTGCATAAGTCACTTTACTAAAAGATCTTTGTT 

AAGCTAGTATTTTAAACATCTGTCAGCTTATGTAGGTAAAAGTAGAAGCA 

TGTTTGTACACTGTTGTAGTTATAGTGACAGCTTTCCATGTTGAGGTTCT 

CATATCACCTTGTATCTTGAAGTTTCATGTGAGTTTTTACCATTAGGATG 

ATTAAGATGTATATAGGACAAAATATTAAGTCTTTCCTTTACCTAAGTTT 

GCTTTCTTGACTAGTAATAGTAGTAGATATTTCTGTAATAAATGTTCTCT 

CAAGATCCTTAAAATCTCTTGGAAATTATAAAATTATTGGAAAGACAAGA 

ACAGTTTTTATTCATTATATGCATTATTATCG 

>Contig28 

CTTTCTCAAGAAAAGGGAACTGGAGCAATTAAACATATGTAATTTTTTTT 
TAAAAAACCCTAAACCTAAACATCTACCTATATACAAAAATTAATTAACA 
ATGGATCATGGACTCCAATGTAAAACATGAAACTCTAAACTTCTAGAAAA 
AAAACTGGAGAAAACCTTTGGTACCTATGACAAGGCACAGTTTTTAGACT 
TAACACTAGAAGTGTGAACTATACAAGAAAAAATTAATAATTTGAACCTT 
ATGAAAATCAAATTATTTGCTCTCCAAAAGACCCTGTTAAGAGGATGAAA 
ACTAAATTACAGATTGAGAGAAAATATTTGTAAATCACATATTTGACAAT 
GGACTTGTATCTAAAATATCTAAAGAACTCTCAAAACTCAACATTAAAAA 
AAATATCTAATTAGAAAATGAGTGAACATTTTACGAAAGGGGCCTTATAG 
ATTAGCAAATAAAACACTTGAAAAGATACTCAGCATCACTAGCCATTAGA 
AAAATGCATATTAAAACCACAATAATGTATCGCTACACACATATAAGAAT 
GGTTTATGAAAAAATAGTGATGACACCAACTGTTAGTGAAGATGTGGAGA 
AACACTCATACATTGCTGGTAGAAATGTAAAATGGCATAGCCACTGTGGA 
AAATTATTTGGCAGTTCCTTTTAAAACTAAAAATCAATCTACCACACAAC 
CCAGCAATTTCATTACAGGGCATATATCCCAGAGAAATGAAGATTTATGA 
TCACACAAAAATCTGTACACAAATGTTTTATGGTCACTTTATTCATAATA 
GCCAAAACCTGGAAACTATCCAAATGTCCTTCAATGGGCAAAGGATTAAA 
CACACTGTGATACATCCATACCATGGAATACTACTCAGCAATAATAAGGA 
AAGAATTACTGCTACACACAAGTTGGATTAAACTCAAGGAAATTGTGCTG 
AGTGAAAAATTAACAAGCCyiATCTCAAAGGACACATAGTTCATGATTCCA 
TTTGTATAACATTAATTAACACAATTAATTACAGAGATGGAGAACAGAAT 
AGTGGTTGCCAGGGATTATACATGGTGGACGCGGTGAGGCGGGCCTCCAC 
GCCTTGGAGATGAAGGGGGCTACACCCTTTAAAGCACACCCACGAGAGAG 
TTTTGTGCGGAGGGGCCCAATTTAAGTACTCCGCCCCGGGGGGGGAACAC 
AGGGGCAAACAAAAAAAATTGGCCTTGGGGGTGACCAAACACACAAAAAA 
AAAACAAACACACAAAAAAACAACNATGGGTGGGAGGATTAATCGCCAAA 
TCTGAGTAAGCTATCTGGACAGTACCAATATCGATTTCCCAGTTTTGATG 
T7GTACTA7AATAATGCAAGATGTTAACATTGGAAGAAGCTGGCTGAAGG 
GGGCTCAGGAACTCTCZTGGACATTTCTTTGTACCTTCCTGTGAATCCATC 
ATTATTACAAAATAGGACATTTTCTAAAGGTTAAATCATTTTAATTTTAA 
AATGTCCCTGTTACTGTTGAAACTCACATCTCCATATACTGATCAAGAAC 
AGCACTAATGGCCCCTGGCCTCCAGGAATTCACAATTCCTACTGACTTTT 
CTTTGAAACCTTGGCCAAX3TCGCTTCTCCT 

TCTTCAAAATGAAGATTGAATGACTATTAAAATCTCTTGCAATTCTTGAG 

ATGAAGGGTCCTAAAGGAACTGAAGAGGATGCCATGTAATGTAAATATGG 

GTTTTTACTCCATCAGCCAGCCAAGACAGAGGGCAGACACCAAGACATGG 

TAACCAAGGAGGCCATGTGTAAACAAAGACCATTTAGACTTATGCTCTGG 

CCTTTGCAGCCCAACTGGTGTGGCCAGTTGGTGGGGTATGAAGAAAATGG 

GGCCTTCCAGGAACCATGTTGAGTGGAGATAAGCAGGGAGGAATGCAGAA 

GACATGGGGGCAGTGCCAGTCTCAGCCCGAGCCAGCTACACCCACACATG 

GTTATGAAAGACTGACAGCCTGTAAGNTGAACACAGCCCTGCCTCTCTTA 
GATAGGC 

>Contig29 

GCAAATATGATCTCAGATGTGGATTTACTGTAAAGTTCATCAAATTTAAA 
TTTCAGAACACTTAATCTGCAAGAGTCCTTTCCAAGACCCTATACCTAAT 
TTTGTGTTTACAATTTTATATTTGTTTTCTTAAAGAAGACCACCAATATA 
AACTATATCCAGCCTTCATGATAAGTACATAGGAAACTATGCAAATAAGG 
GGGAAAAAAAACAAAGAAAAATACCTAGTTTACTAATGGTTCACTTCTGA 
ATAGCACATATTCATAATGATACAAGCACTCATTACTAGTCTAGGAAAAT 
GAAGATATAATTGCATTAGGAAGATCAAGAGGTAGGAAATGTGGATGTGT 
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GTGGTATAGACTAGGGCAGGACAAAGAACCTAAATCCTCATTTTCTAAAG 

ATAATTGTTAATACGTAAAACTCAAAATTCAAGAAGTAACAGTAAAAGCG 

GTCATTAAGAAACAAGCACTAAACACCAGATAGGAAGCGAGAGATGGGGG 

AAGAGGGCGACAATCTGATTATTTTTTGCAACAAATTTTGTAAAACCATT 

TGACTGTTTAC^TGTAGAACTTGGATCTTTTTTAAAAAACACAAAATAAT 

AATACTATTATTTTTTAACTGGATTTTTGAAAAAGAAGATAAAAGTCTCA 

TTTTAGTAATTAAAACTCATTCCAGGTTAGTCCACTCAAAACTTATATTC 

GAAAATTAAAACTTTGGGAGGCTGAGGCAGGCAGATCACCTGAGGTTGGG 

AGTTCGAGACCAGCCTGACCAACACGGAGAAACCCCGTCTCTACTAAAAA 

TACAAAATTAGCTGGGCGTTGTGCATGCCTGTAATCCCAGCTACTCGGGA 

GGCTGAGGCAGGAGAATTGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAG 

CCGAGATCACACCATTGCACTCCAGCCTGGGCAACAAGAGTGAAACTCCA 

TCTCAAAAAAAAAAAAAAAAAAAAATTAAAACCTCTGGAAGTTGAGTTTG 

CAAATATTCATTATGCTCATTTTTAACTTGTATGTTTGGAAAATGTCATG 

ATGAAAATTGAGGTTGGGGGATGAGAAAAAAAGAAAAACATCAACCCCAC 

AGCCCATTCAATTTTCAGCCCGACCCACAGCTCCGGGGAAGGGCAGCAGG 

TCCATCCTTCACTCTTTCTTCACCTCTTTCCCCTCCTTCTGGCTCTTCCA 

CCTCTAATTTGGAGCCCAAAAAAAGGCACTGGGAAATGGAAAAGTCTTTT 

GTACGTGGTACTTGCCGGGGAAGCTGCCATGAAAACCTGGCCCCACGGTG 

GGGAGGGAATGCCCANCTGAGGCCTCGTGCCCATGCTAGGATAGACTCGT 

CC^^CATGTCAGGTGGTCTGACAGGGCAAGCANCANGAAATCATCSTATG 

AGTATGAACTGATCTGTATGCAAGGGCGGGGAGAACACGCGGAGGAATGG 

GGCGTGAGAAAACAGCACAGTACGTTTCTTTAGCAGCTGTCTCTGCTCAG 

CCATGGGAGGTCACAGAGAAAGAGGCTTGGAGGCGTTATTTTCACTGTGA 

GATGTGAGTGTAAAAAAGTGCCCAAGACACAGTGAGTACCAGGGAGATGC 

CCTCTTTCCTACCCGAATGCAGAATGGCCACAGGCCTTAAAACACACACA 

TGGGTCCTCAGAGGAGAGAGGCCTCCACAGTGGACACCCGCATTCTCCCC 

TGGTCAGCAGCAGCAGGGCGAGTGCTGGGCCATCATGAAGCTTCACAGGC 

AATGAGCTCTCAGCAATAACAGGAACAGTGCCTGGGGGACTGTAGCTGCA 

AGACCGATTTTCATGTAAGATGGCCTCTGAGGACTCCGAGATACACCAGG 

CTGAGACTAGCTGGCAGCTCCAAGTTGTTGGTCAGAAGAGAACAGGAACT 

AGGGAAATTGGAATTACTGTTACTACAATTCCTTTACATCCGCACAACCA 

TGAGGTCCAGCGATTTTCTATTATTTTTTTTTTTAAGACAGGGTCTCAGT 

ATGTCGCCCAGCATAGAGTGCATTGATGTGATCATGGTTCAGTACAGTAT 

TCACGTCCCAGGCTCAAGTGACCCTCCTGCCTCAGCCTCTCAAGTGGCTG 

GGACAGCAGTTGCATGCTACCAGGCCAGGCTTTTTTTTTTTTTTTTTTTA 

GTTTCTGTAGAGCACATAGC 

>Concig30 

GGTTAACAATGGCACAGGGAAACAAACAGTTCCAGG7GCAGGGGCTCTAA 
ATCTATCATAAGATGTTAGGTATGGGGGCTCTGCCGGACACAAACTCAAG 
GCTTTATGCTGTTATCTCTTGAGCGAAATCCTGGGAACTTCGTACATTGC 
TTGCTTCAGTACCTTATCAGTTAATCGGACTCTTTGATATGTTGGGAGTC 
AGCGTACACAAGTTAACTCCTTGAGGAAGGGGGTGGGTAAGGAGTCCTTG 
ATGTCTGGTAAATGAAGGAGCGAAATCGAGTTCCTCTGGCTTTCTCAGCT 
AAGGGAGAGCTTATTCATGTGGAAACAAGGCTAAGTGATTAAGGGAGAAA 
GGGAGAGTCTGAAAACAAGGTTAGGTATTACAATGTCAATAAAATTGGTC 
TCCTTATACAGTCCTATGGTAGATTTCTTTCCATCTTTAATCTCCCTCTA 
GCACCACCAGACTTTTTCTCTCTGTACCTTGAGATGTAAATTTTGCTATC 
TGAATTTTCGTCTAAGAGTTGTTTCCTTTAATATGCAAATTTAGGGTTAT 
TTAGCTGACAACTGCCAAAGTAGTGAAACAAGTTATCAAGAACTTGAACG 
TCTAAGGTAGGAAAAAAAAAAGTCTTTATGAATCTATAAGATGTACTTCT 
ATTGGCATGCCTAATACGTCTATGTATTTACGTGTTGTGTACACAGTTTT 
TCACTACTGAAAATATATAGAGGAGTTCTAATTAATTGACTTAAGACAAT 
AAAAGCG CTTGAATCAAATAC CTT AT CAGGAAAAAGGAAAAGACAAGTCA 
AATGCTTGTTCAAGTTTATATAACTTAAGTAAAATCTTTAATAAATAAGC 
TAGCTTTAACATTATTTGAAATGTCTTAAGAATTGCCAGCAGGTTCTGGG 
TTACAGAACTAGTGGGGGTGCAGTGGGGTGAGGGTTGGTGGGGTGGGGGG 
TGGTACGGGGGCTTTGTTTTTTCTTGCTGCCCCCTTCTGGGTTGGGGAAG 
TGGCAGGACCTTGGCAGCACCCCGAGCCGGCATGGCGTTAATAATGGAGG 
GATGCCAGACCCAAGTGGCTAAGGCCCGGCTGCAGAGCCAAGTTGGCATT 
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TCCAGACTGGGGCTCGGGCCGCACCCTCTCCAGGACCCTCCCCTfGfACCr 

GAGCAGATTGTCGCGGGCAGTTTGGGCCAGCTGTCC7GGCGTGGAATTTC 

CCAAATTCAACAAATCCTCCAAGAAATCAATCCATCCATTCATCCATCCA 

TCCATCCATCCATCCATGCATCCATCCATCCGTGGCAGATTATGAAGCAT 

GGATCATTACTTTTGGGATGTGGATATATTCAGTTAA.CAAGGAGCAGCTT 

TCAAGAGCTGGATTTTATGCTTTGGGTGAAGTTTAGAAACACTAGCTCCC 

AC* 

>Concig31 

ACCTCATGTGCTCTAGCGCCTCTTACC7CATGCCCTCCACTCTCAGTCTT 

GCACTCACCCTGCCACACTCAAGGGCTTCCCCAGGTTCCTTCTTAGATTC 

CACCGATAGCTCAGGGACTTTGCACATGCTACGGTCTCTGCCTGGCTCCT 

CCCCAGATCTTCTCATGCCTAGCTGCTTCTCATCAGCACCCCTCAGAGAC 

TGTCCCTGCCCCACCTCTCCAGGTTCCATACCTGCCACCCTCCCCCAATC 

ACGTAACAGTTTCTTCACAGAGCGAGTTACCATCCCAGTATTTCCCTAAC 

TTATTTTTTGTGACTGGTCTGTTGCCTGTCTCCACCACAAGAACATAAGC 

TGCATGTGAACAGGAGCCTTGTCTATCTTGTCACCCCAGTGGCTGTGACA 

TAACCTGATACACATTAGATGCTCAATGATGTTTGATGAATGAAGTGCTG 

GTAGTCCAACTGTGTTTCCTTGTCTGTGTAAGTATGTCTGTTGTGGTTTC 

CTAAGAACCTACAGCTCTCCCACTGTGACTCCTGTTCTATGGTCCTGATT 

TGCTGGACTAGAATCCTAACCTACATGCTTACTCTTAGTGTCCTCCCCCA 

GAGGCTGAATCCCAGTCCCTAAACCTCCACCAAATGGCTAAGACCtAGCT 

TCCAACCAGACAGGCCTACGCTGAGACCTCAGCACCGCCCTTCTGCGGTC 

TCATCCTTAACGCATCCTTCAGGGCCCAGCTTAAATGTCTCTTCTCCAAG 

GAAGGCTATCCTCTTTCTGCCCCTCAGTGCTCTCCATGCCTCCTCTATGC 

CTCCATGCCTGCTTTCCAACCCTGCAGAGGTGGAGAAGTTGCTAATCTGC 

TGTGTTGACATGTGCTGGGGTGCCTTGGGCCAGGGAGCAGGCTGGTGGTG 

TGCTGATAGCCCGTGGCTGTGCCCAGGTCCATGCTCACTTCCTGAGCCCC 

AGTGGAGTAGGCTCCCTTTCCCTTATTGCAGCACTCAGAGGAAGGACGTG 

CTTCTTAGGACAGATCTGGCCAACCTCTCCCTCGTGAGAGAAGGCCCAGC 

CATCCTCTTGCCCTCTTTCTTTCTCCTGCCCCCGAGTAATAAAGGTGCCT 

GGTCAGAGCCTTCTAGAAGGAGACCCAAACATCCACCACACATTCCCAGT 

TCCAACCGTCATCCACATGGCTGGCTGTGCAGGTAAACGCAGAGTCTGTT 

TCACACACCCAACCATCTAGTATTGGATGGGAGGACAGTAGCGTGACACT 

CTTCTCCAGCCTTGAGCCCTACTGTGGGCCCCACCCAACCCAGATACCAG 

AGGAGCCCTGTACTGGGATGCTATTGGATGCTTGTCCAGTCATGTACAAA 

GTTAGCCCTTTGTTATATAGAGTTAGCTACGTACATCTTCCTCTGTAGGG 

AACCCAAGAGGGGAGAAGAGATATGTAGTAGGATTTAACCTGCAAATCCT 

CTGCTGAGCACCGTGCACTACATACAGTGGGTAGCATGTGGTAGGTGCTC 

AATAACTATTGACCGATCTATTGAATACACGTAAGATCGTGACACTATCT 

AAAACGNGGGGT GTGGGGGAAAAAC CCCCCCCTT GTTT AGGAAAC C CAAA 

TTGGACCGTGTTGGC 

>Concig32 

GCGCGATTGTGCTAAAGATCATGCATGCCTGATCAAACGTCCCCATATGG 
CGTCTCAGAGTCAACTCCTTCCCCATCAGTGCCCTGACTTCGGCATAACA 
AACCTGGCAGGTTAAGTGATTAATCGGTCCTGTACAACTGTAGCCCTTAG 
CAGGAAGCACTAAGCITCGTTTTCATTTATTTCTTCCCTGGAACTGCZAAG 
AAATGAGGGATGCCTTCCGCCATGAAGTTTTGCTGATTGTCCACTTTGTT 
CTCAAGGAGATATTCACAGTTTTTAATTTGTCTTTCTCTCCTGCATGGTC 
TCCAAACCTGTCCAAAGAAGCCAGCTGGCTCCATCATCTGTAAAATCACC 
ATTGTCACCAGAGCACTTGACTTCCTGTTGCCCTACAATCCACCTGCACT 
TTATTTCCTGCCACCATGATAATGTAGTGTTACTACATTTTACATTCAGC 
TGTAAGAAATGTTACATTCATTTACTTAAATCAAATTAAGTCTGCTCACT 
CAGTCCCCCACAGTGACCAACTTATAAAAGAGAAGGTACATTTCAGTCAT 
CACTGAGGTTCTCTCTACCACTGGAAAACTGAGGAAGGGTCTGGAGTCCA 
CAGTGGTTAACATCATTGCCTCTGTTTTTTCTCCTACTCAATGTAACCAT 
CCAAGGTTACTCACAAATTCACAAAAAGAGGTCTTCACCTCTGCTCTCAA 
GACCCAGAGGGCTGGGTTCTAAACTCAAAGGCCAATGTTCCCCAACTTTT 
TGCATTGTTTCAACATTGGGGAAAACTCGAGGGGATTCAAGAATGGTTAT 
ATAAGTTTTGTGGAAAAATGTATAATTTTTTAAAATTAAAATACAAAGTA 
TTATGGAAAGCACTAAATATTGAATTTATATAAATATTCCAAATATTTTT 
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CTAAATTTTTAGTGAGAAACTTGAGCTTGCTTCTGTGAGATATTTATTTT 

AAAACAGATTTGACACTTAAAATGTCTAATCAAGCCTTTTAAACCATGAT 

CTATCTCTTCAAATTCTTCAGATGCCACCATCAATAAAGAAAC7TTGTTC 

ACACAAGTAAGTGGTAGCAAATGGCAGGGTGTTTATCATTTTTTTTTTTT 

CTTTTTTTGAGACGGAGTCTCGCTCTGTCGCCCAGGCTGGAGTGCAGTGG 

CGCGATCTCAGCTCACTGCAAGTTCCACCTGCTGGGTTCACGCCCTTCTC 

C76CCTCAGCCTCCCGAGTAGCTGGGACTACAGGCACCTGCCACCACGCC 

CGGCTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCGTGTTAGCC 

AGGATGGTCTCGATCTCCTGACCTCGTGATCCGCCCGCCTCGGCCTCCCA 

AAGTGCTGGGATGACAGGCGTGAGCCACCGCGCCCCCGCCGTGTTTATCA 

TTTTTTGCCTGATGAAATTTTCCTTGCCACTACTCTGGATGGTTTGATAC 

ATTTAAATTGTGCTTCCAGGGTACAATTATCCTTTAAATCTATACCTCTT 

TCCTTTCTTTTATTGACAAATATAATGTTACACTTTTCTGTCATTGCAGC 

CACACCACCAGTACACAGATCCCAACAGAGTTGTAATATTTTATTAGTTT 

CAGAGTTTCAATATTTTATCACTTTCAATACTTCATGTGCAGGAGTTTTA 

TTTGGTACTTCTTTACAAAATAAATGATGTGCTTCCAAGCATtTCTTTTC 

AATAATTCCAATCAATGTTATTAACTGAGTAATACTAGTATCTGTTTATT 

CATAAATTCACAGGAAATGCTTTTTTACTTATTAGTCTTTGGAATTCTGT 

TGTTTGTATAAACATCTTTCATGATGGCTTTGTGTCTACCAATAGCACTA 

TTGCCAAAAGGCACCTTTTTCTTGTTCCTTTACTTCACTGGTCCGAAGCC 

TGGTACCAACAACTACCACACAGACTGGGAAATGAGCAATTTTGCCACGT 

GCCCTTAGCTATTAATGGTGGCACTCCATAACTAGCATCTTAAGCTCAAT 

TTCATGAAAGAAATGTGTTTCTTATTTTGTACT7GCAGGCACTTTTTAAA 

CTTGTAATCTTTTATTCATACTTTAAAATTAAAACAGAGTAATAGAACCC 

ATAGAAGGAAATCAATACCCACGAGTCCATACTGATATAAATAAATAGTT 

ACATAAATAAATGGGGGGAGAAATAACAGCTCTTCCTTACAGAAAAATTT 

CAATTAATAAATGAAGAAGGAATTAGGGAAATACAACGTTACCATTAAGC 

AACCACAGTAATAATCATTACAGGCAATATCCAAAAATAAATTCCAAAGC 

CAGTGGGCAAAAGTTTGAGGAGATACAGGATATTAACATAGTCTCCAAAT 

AGCTCATGCTATTTATAAATTACAAAAGGAAACATAACAACTGTATAGTG 

AAGAAACTCAGCAGACACCACCTTAGCCAAGTGATCAAGGTTAACGTCAC 

TAGTAATAGGGCTTGTTGACATACTGGACTCCAATCTGATACACTGATAA 

GGACACATGACTTCTGCAGTATTCTTACCAAAAACAGAATTCTAATGTAA 

TTAAGGAAAATGTCAGACAAACCTATTCTGAGAAACATTCTATAAAACAA 

CTAACCAATACTTTCAAAATTGTCAAGGTCATAAAGACCAGGCGATGGTC 

ACAGATTTGAGGAGACTAAGGAGATACAACAACTAAATACACAAATGGAA 

CCATGGCATTCTTGATTGGATCTTGAAACAGAAAAAGGATATTAGGAAGA 

AAAGCTGATGAAATTCTAATACATTCTGTAGTTTAATTAATAGTATTGTA 

CCAATATTAATTTCCTAGATTTGATCATTATACTATGGTTAAGTTTTTAA 

CATTAGAGGAATCTGGGAGAATGGTATATATGAACTCCACTGTTCATTCA 

ACTTTTTCAGTAACTATTATTTCAAAATAAAGTT 

>Contig33 

GGGAGCGGCGGCCCACGCTGATCTCTAAAGCTTTAGACCACATTGGCTCG 

AGCATGGTCATGGCCGTTTCCTG 

>Contig34 

GACGTCTTAGCGCTATATTATAAAGAAATATTCACCTCCCTGCTGAGCTT 
ACAGGGTGTACCTAATGTCCAACAATATGAAATCTCTTCAATGAATTGCA 
GCACGTCCATATATAACCCACATGGAAGCTGTCCTCTTTCCTCACCTTCG 
AACTTCCCATGCCZAAAGAGGGACCTCTTGGACTCAAATACATCTTAGCZAA 
TATAGAAGATGCTGGAGACTTGTAGGAGAAGTGGAGAGGGTTTACAGTGT 
AGCCCCACAGAAAACAACTTATGACCCCATCAGTCACTTGTCCCTTTTTT 
CCATGCCTCAGTCTAGTCAGGAAACCACTAGATCCTGGATGGCTTCTTCT 
CCCTTCCCCTCCTTTCTCTTCTCCTCTCCCTCCCTTGCTCCTCCTTCCTC 
CATCACCCACTCCTTACTTCCAACCAAAACTTGACTAGCTCCAGTCTCAT 
CCCTCCTTATTGAAAACTATTTTACTCAGCCCTCCTCCCCCACTCCTGCC 
CAATCTTTATTCCTTACCTACATCAGACTTCACCAAAACAAAGGCCAGGA 
TAATAAACAGGACAAACTCTTTCAAACACATTTTAATGACCATATTTTGT 
TATTTTGGTACAATTTGAGGAGTCCCAATCCCCAGGGAAGACTAACAAGA 
AGTTCTCCTAACAAAGGTGGGTCTCCCCTTACTAAAAACTCCTGTAATGG 
CTGAAAAGAGCATGAGGTTTTCTGCATATCATTACACATTCAATAGAACG 
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TCATGCAGCTGTTAAAAkaGATCTGTAG* ^GGCTATCTTGTGicAGAAA^ 

GCATTGGAGATATACTGTTAGTGACAAAAATAGGTTATAAATGAATTTTT 

CCATGCATGCC7CTATATTTATAAATACACACACATAAAAGACAGGAAGG 

ACAGACATTAAACATTCATAGTGCTTAAGATGATGCATAGTATAATAGTT 

AGGACCATGGCCTTTGGGACAGAAAACTACAGCCTCTCTCCCACTTATCA 

GCCATGGGACCTTGGGCAATTTGCTCAGCCTCAAAGCCCCTGTTCCTTTA 

TCTGTGTGCTGGGGTTGTTGTAAGAGTTAAGTGCAATACACAGAGAGAGA 

GAGAGTACCTAACATGTATTATGTGCTCAGTCAATATGCATCATAGTACT 

CATTGTTACATATGTTCCTAAGTGCTTTATACGTTTTTTCCCTAAGTTGA 

CCATCTGTTTTTGGCATTATGAAACATAATGATCCTAACAAATTAAAATT 

AAAAACATAAAGAATATTTGCCCCAAAAAAATAAAGAACATGAATTCTTC 

AAGTAGCCAAGGGGCCATAGACAGAAGTAAGCCCTTGGTGGGGCTTAGTT 

GAGAGAAGTCTCCAGAAGGTCTTTCGTGTGTTAAAGAAGAGGGTAACAGG 

GAGGAGGTGGGGAGAGATGTTAACTGAGTCTAAATGAGCACCTGGAAGAA 

GAGATGGGACAGGCCACTTCTGCCTGGACTCCCTGATTGTTAAGAAGAAT 

GAAAAAGAGCAGAAGTCTTCCCTGAGCCCAACTTCACTCCCTGACTTAAC 

CTAGTCTTTGCCCCTTCCCTCTCACTCATGGCTACTTTCTGTGGTCACCT 

TGTTGTAGAAATGGATGTGCAGCCACCTCATCTTTTTCTACCTCCTTCAC 

ATGTTTTAGATAATTTAATGTAGTAGAAGACGGTTACAGCAAAAAATTAC 

AAAAATCAAAATATCTCTGCTATCTACTGTTGCATTTCTAACCATCCCAA 

AACAGTAGCTGAAAACAGCACTCGTGGTCGAGCGCGGTGACTCATGCCTT 

TAATTCCAGATACTCCGGAGGCTGAGGCAAGAGAATCACTTGAACCCGGA 

AGGTGGAGGTTGCAGTGACTCAAGATCATGCCACTGCACTCCAGCCTGGG 

TGACACAGTGAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAGCACTCGTG 

TATTTTGTTCAAGATCTGTGGTTTGGGCAGGGCAGGGCTCAATGAGGACA 

TCTCGTCTCCGTTCCCGCAGTGTCAGGAAGTGTAACTGAGACTGGAGGGT 

CACACAGAAGATGGCTCCCTCAAGTGGCCAGCAAATTGGTGCTTACAATT 

GACAGGGAGCTGTTGACCAAGGGCCCCAATTCCTCTTCCTATGGCCCCTT 

CTCGGGCTGCATGGGCTTCTTTACAGAATGGCAGCTGGATTCCAAGAGCA 

AGTATCACAACCTACAGAAGAGTGGAGGAATATTGAAAGTTCACAGTCTC 

TTAAGACGTTGGCCCAGAAACTGGCAAAAGCTTCATTTCTGCCATGTTCT 

ATTGATCAGTCACAGAACCTGCACCAATTCAAGAGGAGAACATATAGAGG 

ACATCTCTCAATGGGATAAGTGTCAACAAATTTGCATCTATCACZAATCTG 

TCTTTTGGGTACAAACTATTTCTATTCCTCCATTATGCAAAATATACTCA 

CAACCTCCCAGGGGTCGCAAAAGCCTCATCCATTTATGGCAAATGTGGCC 

CTTTTAATTTATATAAAATAATTTGCGGGGGCTTCCTTTATATTTTTAAC 
TCCCCTGC 

>Contig3 5 

GTGCAGAGAAGTGATTTAAAGCCCTTCAGAAAGAATGCTTTATTCCCGTG 

GAATTTGG7AACTTGCTTGGGTGTGGGGAGGTTTGTCAGCTTTCTCCACT 

CAAATTATCAGACCCTTTCCATTTAGTGGTAGACCATTTCCCTCGTCCAG 

GCCAAGGGCACATAGTACAGAGAAATAGGGAGTTGTTACCCAGGGAGAGA 

ACTTGGCTCTAAACCTGTAATAGAAAGGTCAGTTCTGGTCTGGAGGGTCA 

ATTTTGATCTTTGGCTCAGATCCAGGAATTGGAACCAAGGCTTTTGAACA 

TTTTAATGCAGGGGATTAAAAAAATGATACGAGTCATTCACGAATATATT 

TGCTTAACATCTAAAGAGATCCCTCAAAACACTAGAAAAAATAAGAACAA 

AAATCTAATAAAACAAAATTTGTTAAACACATTTACCAAATTTTTTT T TT 

TGGTAAAAATTCAAATGTCATAAATAAAGCTAAAGTTCCTCTTGATGACT 

CGCTCCTCTGCCCTATTCCACTCCAAGTAACCACTATTATCAGTCTTGCC 

AATACCCTTCCAGACCTCTCTACCTCTATATACCATTAGAAGCACATGGT 

TTTGCATTGAGGATGTGCAGTGTTTTGTTTTACGTAAATGTTATCACTCT 

GTTCTTGTTCCATAATTTGCCTTTTTCTCTCAATGATTTGCTTGGCTATC 

TTTCTATTTCAGTAGCATCTCCTTTCTTTTTAACTTACCATTGTTTA 

AACCTTGCCTCTATCAACAGATATGTAGGTTGTTTCTAGTTGATTTCATT 

AAGTATTTATAAACAACGCATCAGTAGATGTCCATAAATTTCTTTACGGA 

AGATGGCAAGTAGTGGAATTGCTGAGCCAAAGAACATGTTTAAAAAACCC 

AAAAAAACTAGACGCTACCAATTTTCTCTCCAAAATGGCCATACCCACTT 

ACCCATACAGAGATGATTTGGAATCTGGCTTCCTCACAAGGTGAGATGCC 

TTCACAGTTTCATTCTTCCTGGCATGTCTTCCCTTTTGTATCTGAGAGAG 

CTGGCAGAATTGTGTCACTAAATCAAGGATAGAGGGTCAAATGACAGCTC 
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AAGCTCACAGGCACCTCI'GCTTTCTTCLwAGACCACCTGCTTTCCTGCCa 

CCAGCTCTGTTCCATCTTATAGAATGGTTGCCACTTGGGTGTCTGCTCCG 

ACAGCCATGTCATCCTTTGCACTGCAGTTATGAAGCAGACAGAGCTAGGA 

GAGGGGCTTTGCCAGCCTCTGCCCTAGCTTGGAGAACTTCAAAAAAGGAG 

GGTATTGAAGTTGAACTCCCCCAAAAAGGGGTGGTCCCCACACCTCAAAA 

AGTGGTGCCTCCGAAAGAAATGTAAAATTCGTGTGGGGGGGGAAAAAGGT 

TAJTTAGAAATTGTTGGCTTGTCGTGCCGAAAGTATGTGTGGTTACGGGG 

AGTACGGAAATTTCGAGGGGTGGGGGCGAGGCCGTGTGTCCTTTAGCCCG 

GGGTTTTCCCGTCGCATGTTTAAGGGGGGGGAAGAGGGGGGATGTTTTCT 

TTCCGCGAAGGTTTTTGAAGAACGGCGTGG 

>Contig36 

CCCCCCACCGCCACTACTCAACCGGCCGTTCACGAAACAACTCGCCACAT 

CCACTAACCCGCTGGCTCACCACCCACCGCCCTCCCGATCCCCCCAATCC 

AAACTCAACCCCCACCACCAAGCGCCTCCCCCCTCCCCCACCCTCCAGCT 

CAGCCCCAACCTACCACCAACCCCGACTCGCCCACCGAAAACCAACAGCA 

AACCCAAATGCCCACAAAACCAGTGTCCAAACCCTCCTTCCCATCAGTTT 

GGTGGGCCCATCACCGCTTCCCCTGGCCCAGGCTCTCCTTTTGTGCGCTT 

GGAGCAGCAGACTGATCTCCCAGCCTTCACTCACTTCATGTGGTAATCTG 

TTGTGTTCATCACTGTCAGAATCTTCTGCATCCCCTCACTACTCTGCTGA 

AAACACTCTAGTGGTTCCTCATTGCTCATTAATGAAAGTCTAGATATTAA 

ACGTAGAAGGCCCAGCACAATTTGCCCCTATGCCACCTACCTCTCTAATC 

TTTTCTCCTTACTCTGACAGACTCTCCGTCTGTCATTTATGTATTCTTTT 

ATTGCTCTCTTCTACTTTTAGTATGAACTGGATTTATGGATTTTTTTAAC 

ATTGCTTTCAAGTATGGAATAAAGAATTTTATTTATTTATTTATTTATTT 

ATTTGAGACTGGGTCTCACTCTGTTGCCCAGGCCAGAATGCAATGGTGCA 

GTCATATCTCACTGTAACCTCGAATTCCTAGGCTCAAGCCATCCTCCTGC 

CTCAGCCTCCTAAGTAGCTATGACTACGGGTGTGCATCACCACATCTGGC 

TAATGGAATAAAATATTACAATGCCTAATCTTAATTTTCAAAATTTTAAA 

TTACATTGTACCTAATGCCCATGCATTTACTTTTTTCAGTGGGTCAATAG 

CCCTCACTTTGGCAAAGGTCCCAGGCCCAAGGTAAGGCCTTACTTTTTCC 

AAACTCATCTTTTGAAAGACATAAGTGCCTGTAAGTTGTACCACATTAGG 

TTCTAGGAATTTTTCATCAAAGACTTTATCAGACTATTTTCCTCTAAGTT 

GAGAAAGAGCTGGGGGCAGAATATGGCACTGAATGACTGAAGAGAAGGCA 

CTGAAATCAGGCCAGAGGTTGCTGGAAAGAGCAATGAGGAACACCAGCAG 

CAATGAGGAGCCGGTGATGATTTTGGCTTCACAGGGAGGTGTGTACCACA 

CCGATTTTATCTCTACGTGGATGAACCACAGCTGTCGGCTCCCTTGTCTC 

CAGGACATCACACTCTCCACATTCCCTCCCATCTTCCGGCTTCTGCTTCC 

CGGGGCCCTCATCTGCCCCATCCTGGGTGAACACTGGTCGGTCAACTGCT 

GGGCGTACCTTCCCGCTCTGCACACCCTCCCTGGCCACCCCACCCACTCT 

CACGGCTCGCACTGCAGAGGAGCCGCATCTCTAGCTCCAGCCCATCTGCC 

TCTTCTGAGCTCTAACTTCATGTAGGCGACTCCTGCCGGTGTTGCCTCAC 

AGGCCCATCATACTTCAAAGCATTTTCCCCTCAGAACACCATGTCCTGGC 

TGCTCCCTCCAGAAGATACATCTCTCAAGCACATCCCCGCGGCTCTCACC 

TGGATGACTGC^TTCACCTTCTCCCACATTTGCCCCCTCCTTTGGATGTA 

TATAGATTGTTTTAAAATACAAATCTGATGTGCTTGCTCTCCTGCTTGAA 

ACACCTCAAAACTGCCTTCAGGATAAACCACTGCCCTTGACATGTTCACA 

GGTTGCCCATGGCCTGGCCCTGCCCATCTCTTCAGCCTCATCTCATGCCC 

CTTGCCCCTCGCTCTCTGGGCTTCTGCCTCCCTAGCCCTCCTTTAGGTTC 

TCTAACACACCATAGTCCTTCTAGTGTTGGGGCCTCTGCAAGTGCTGTTC 

CCATTGCCTGAGACATGAATCCCTCTCCCTATCTCTACCTGCACCTTCAT 

CTGATTAATCCCTACCCTTCCTACTCATGATGTTGCTTTCTCAGGGACTC 

TCTCTGACTTTTTAAACTAATCAGGGTCTCCCCAGTATATATCTTCATAG 

CACTCTGTATTACTCCTTTCTTAATGACCACCTGCTGTAGACAGAATGTT 

TGTCTTCCTCCAAAATCATATGTAAAACCTTCCACCAGAGCGATGATTAG 

AGAAGCCTCCC 

>Contig37 

GACTGACATTCAGAAGATATTAATAAGAGCACTAATGATGGGGATTGCAA 
CCATGTCTTTACTGACTTCCAGAAGCTTCTTACAGTAAACATGAAATCAC 
ATAATTTCTTCCACTTTCCTACTGTTTCTTGTTCTGGGCTCTGTCCTGCT 
TACTGTCTAATATCTTGGCCCCTTAAAAGTTGCTAATCTTCCAAACCTCA 
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TTCCTGTGACTGGGCC^wTCSGTCCTTG x - CATGGGCCTTGAAGATAC1 

CTGTACACTTATCTGGAGCATCCAGTGCCTACCACCTGACCCAGATTCCT 

CATTGCGCTCCTCCCTCCTCCACCTAATGGGATTTGCTCATACCCGTGTG 

GGACCCCTCCCATTTTCCCCAACTGAATACTTATCAAGACAACGCATTGC 

CATACTCCCTCGTACCCTGCTCTGGGCATCAGACTGAATGTTTGTTTCCA 

TTGAGGATCTGCAGCTGCATCAGTTTCCCCAGCACCGTCCAACCCCTTGA 

GCATGGCTAGTCCTAAAGCAGAGAATTAGCCTTTCTATCCCTGCTGCTAT 

ACATGCTGGGACAAATAATAAGAAATGACAGCATTTTATGATAATGCAGG 

CTGCAGGAGGCAGGAGGCAGGAATCAAATTCGTGCTTATCAAATAGTGCT 

CCAATTCTTTGAATATTGGACTATAGAATATGTCATGGATCTATGCTCAG 

GTGGGTTCCCTATTACTCACTCCACTGAGGCCAGGTTGTGGGATTAGCTG 

TCCAAGAGGGAGTTTCAGTCTCACAGCATAGGGTCATTCTGAGAATTACT 

GGCCCACACTTGTGTGGAGACCTCCAGAGAACAGAATCTGGGTTGGTGCC 

ATGTACTTCCAGGAGGAGAGAAGTGGCAGGATGCCCAGCCCCACAATCAG 

AGGGGAAGGGGCAGAGCCACATGTATGAAGATCCTCTCCCCAGTACGTGC 

CAATCACAGGGCTTCCTAGCTTTTGGGCCAAGGAAACAATGTGGGAAGCA 

AAAAAGGACAATTTTCTCCTCCCTTTGCATGAAGACTGAGCAGTTTTACC 

AGATTCCCAGGGAAACACCCTTCCACTCTGGGTTGAATGTGAGTGAGAGA 

CATTCAGCTGGAACACTAGAAAAACTATTTCCTGAGCCACTCACCTTTAG 

CCCTAGAAAGTGTTGGATTTGTCCTTCATCTTTGCCACAGTAGAGACTGC 

TGATAGCATCAGAACTTGGGCTCTGGAATTAGACAGATATGGGTACAAAT 

CTGAGCTCTCTCACTTATTAGTGTGGGATGTAGAGCAACTTTTAAAATCC 

TTCCAAACCTCAGACTTCTCATGCATGATGTGAGGATTGTAATAGGGCCC 

ACCTAATAGGGGTTTTTGAGAATTAAAAAAGTTATTCAATGAACAGCATT 

TAGCAAGATGCCTGACCATTGAGAAAATAACAAATTGTTTATTATTATTG 

TTATTATTAAACATCTTTCCTGCACCTTCTGACTGGGGGCATCGTATCAT 

CAGAAATAC7TAGGATGGGATGGATTCCTGCATGGGCTGAGTCAAGGGTG 

CAATAATGGAGGAGTGAAGAAGGAAGAAATGGAGGCAGAAATCCCCAGGA 

GCCCAGCATGGTACAAGGCTGAGCTAGTGCTGCAGAGCCTCCTTGGAACA 

GCCACAGAGCTTGCATCTGGCCCTGGAGGAACCTCTTCTAGCTGGCAGGA 

CCAGCCACAACAGTGGCCAGGGGATTTCCCAGGGCGTGGGCTCCTCAGGA 

GTTCATTTGGACCAAGCCTGCCTGGAGAGGGGTTATAACAGGGATCCTTC 

CCTACTGGCAGGTGATTTACCCCTCGGTGAGAAGCTCAGGCATTTGTTTG 

ATGGAAGGTGGAAGGCCCTGTGCTGGGCCAGTGACTATCAGGGATGGGCG 

GGTGGCTGGAAAATAGCAAATAAGACAATATGATAACACAGTTAACCACC 

ACACTATGTGAAGCTACAATATGGGTATCTGTAATAGACAATTCCAATGT 

AGAGAATAATTTTAAGGTGTCATTCTCCCCGCCAATGCCATAAGCACACG 

GCCTCTGCCTGGGTTTCTCACTGTGGAATGTCCTCCTGGTCTCCTCATGC 

CCAGAGAG7GGGAAGTACTCCTACTTTAACACCGGCTTTCCTGTCATTTC 

CNTGCAGCCCTCCTCAGCCCCCTCTGCACAGGGAGGTTTCCTCCCTGCTG 

CTGCAGTGCTTTGTACTTGTTAGTGGTACCTGCACACAGGTATTGGTGTC 

GTTGTCTCACCACCCTACATCACTGTAAGCTCCCCAGGAGCAGGCTTCCT 

GTTTGACTCACCTGTGATCCTCCACCTCCCACCCTGTAGTGCCTCAAGCA 

TTCTGTAGAGCACATGGACGCC 

>Contig38 

GACTAATAAGTACTTCATTATTTGGGTATTTTCCAAGAACAACATATTGT 
A GGAAA CCATTCTTTCTAAAAAAAAAAGTGTCCTTTTAAAAAGGTGAATA 
ATTTTTGTCTAATTCAAAGTTTATTGAAAAGTTATGTATAAAACAAGGTA 
AAAGGAACAAGGAAATAAGGGAAATGTAAAGAAAATTATAGAAATAAAGT 
GGTATTTTTTGGTAAGAAAGCTTAAAGAGAAATAATTTTAGGTAAGAAAG 
AATCTTACCTAAAATTTTGTGCTAGAATAAAGTGACTGGCTAAGAAAGGG 
ATGTTCAAAGCTATTTATGACAAACCCACAGCCAATATCATACTGAATGG 
GCAAAAGCTGGAAACATTCCCTTTGAGAACTGGCACAAGACAAGGATGTC 
CTCTCTCACCACTCCTATTCAACATAGTATCGGAAGTTCTGGCCAGGGCA 
ATCAAGCAAGAGAAAGAAATAAAGGGTATTCAAATAGGAAGAGAGGAAGT 
CAAATTTTCTCCGTTTGCAGATGCATGATTGCATATTTAGAAAACCCCAT 
CATTTCAGCCCCAAAACTCCTTAAGCTGATAAGCAACTTCAGCAAAGTCT 
CAGGATACAAAATCAATGTGCAAAAATCACAGGCATTCCTATACACCAAT 
AATAGACTAACAGAGAGCCAAATCATGAGTGAACTCCCATTCACAATTGC 
TACAAAGAGAATAAAATACCTGGGAATACAACTTACAATGGACATGAAAG 
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ACCTTTTCAGGGTGAAw . GCAAACCAC* ^CTCAAGGAAATAAGAiGAQ^rtA 
ACAAACAAATGGAAAAACATTCCATGCTTATGGATAGGAAGAATCAATAT 
CGTGAAAATGGCCATACTGCCCAAGTAATTTATAGATTCAATGCTATCCC 
CATCAAGCTACCATTGACTTTCTTCACAGAATTAGAAAAAACTAATAGCC 
AAGACAATCCTAAGCAAAAAGAACAAAGCTGGAGGCATTGTGCTACCTGA 
CTTCAAACTATACTACAAGGCTGCAGTAACCAAAACAGCATGGTACTGGT 
ACCAAAACAGATATATAGACCAAAAGAACAGAACAGAGGCCTCAGATATA 
ACACCACACATCTACAACCATCTGATCTTTGACAAACCTAACAAAAATAA 
GCAATGGGGAAAATAATTCCCTATTTAATAAATGATGTTGGGAAAACTGG 
TTAGCCATATGCTGAAAACTGAAACTGGACCCCTTCCTTACAACTTATAC 
AAAAATCAACTCAAGATGGATTAAAGATTTAAACATGGCTGGGCATGGTG 
GCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGATGGGTGGATCAT 
GAGGTCAGGAGATGGAGACCATCCTGACTAACACAGTGAAACCCTGTCTC 
TACTAAAAAATACAAAAAATTAGCTGGGCATGGTGGTGGGCGCCTGTAAT 
CCCAGCTACTTGGGAAGCTAAGGCAGGAGAATGGTGTGAACCCAGGAAGT 
GGAGGTTGCAGTGAGCCAAGATCACGCCACTGCACTCTAGCCTGGGCAAC 
AGAGTGAGACTCCATCTCAATAAATAAATAAATATGGAACTCTCCCAACA 
CAATAATAAGACAAACCCCCAAATGTTTTAAATGGGCAAAAATATTTGAA 
CAGACACTTCACAAAAGAGGATATGTAAATGGTCAAAAAGCACATGAAAA 
GATGTTCAACACCATTGGTCATCAGGGCAAAGAAAACTAGAACCACAATG 
AGATGCCTCTGTACACCACTTAAATGTCCAAATTAAAGAAAACAAGTTTT 
GGCAAAGTTGTGGAGCAACTGAAATGCTCGTGTATTGCTGGTAGAAAAAC 
AAAATGGCATAACCATCGCAGATAATTTGTTGTCAGTTTCTTACAAAGTT 
AAACATATACTTATTGATATGACAGTTCCATTCCAAGAGAAATGAAAACA 
TAAGTCCACACAAAGACTTGTACCTGGGTGTTCATGGTAGCTCTATTCAT 
AATTGCCAAAATCTGGAAACAAATCAAATGTCCATCAGCAATGGAATGGA 
TATACAAATTGTGGTACACATGTACAATAGAAAACTACTCTGCAATGGAG 
AGAAATTAAC CATTGACAAACACAAAAACATGG ACAAAC CT CAAAAACAT 

TATGCTGAGCAAAAGAAGCCAGACACAAAAGACTGCTCAGCGCATGATTC 

CATTCATATGAAATCACAGAAAGGGTCAGTTGAAGGTGCAGAGACAAAAA 

GTAGATCTGCAGTTGCCTGGGGATGGGGTGGGAGGTTGACTGCTCTGACG 

CGTAAGGAAATTTGGGGGTAGGTGGGGGATGGTGGGAATATTTTTTGAAT 

TGAATTGGGTAATAGTTTTAATAGGTAAAATATTGGACCCCACAGTATTT 

GAGATAGGTTTCAGTCAATTTAGACAGTTTATTTTGCCAAGGTTAAGGAT 

GCATCCGTGACCCAGCCTCAGGAGGTCCTGACAACCTGTGCTGAAGGCAG 

TCAACATACAGCTTGCTTTTATTCATCTTAGGGAGACATAATACATCAAT 

CAATGCATGTAAGGTTTACATTGGTTCAATCTGGAAAGGTGAGGGAACTT 

3AAGCAGGGAGCTTCCAGGTTACAAGGTAGATTATTCTCAACAGAAAGGA 

ATGTCTGGGTTATGATAAGCGGTTGTGGAGACCAAGGTTTTATCTTGTAG 

ATGAAGCCTCCGGGTAGCAAGCTTCAGAGGGAATAGATTGTCAAAGTTTC 

CTATCAGACATAAGGTCTGTGTTGATGTTAATGCTGGTCAGCTTTTCCTG 

AATTCCAAAAGGGAGAAGGGTATACTGGGGCATGTCCAACCTTCCCTTCC 

ATCATGACCTGAACTAGTTTTTTCAGGTTAACTTTGGAATGCTCTTGGCC 

AAGAAGAGGGGTCCATTCAGATGGTTGGGGGGGCTTAGAATTTTATTTTT 

GGTTTACAGTGAAGACTTTTCAAGCTAGAC3VCTTAAATGAGTATGTTGCA 

AAATGGCAATTTCTTAGCACGGC 

>Contig3 9 

GACGTCCTAAAGAAATGCTAAGGTAACTCAATTAACTATGCTAGAAAAGA 
GAGTTAAGTATTTAGGAGGATTTAATATGGTGTTAAAGTTGTGAAAATCA 
AAATGGAGACACTAATGTTAAGAAAACCCTGATAAATGGAGCCAGGGAAG 
GCCATGAAGAAAGAGTTCTCACACTTGTATCCCTGATCATGAAAAAGACT 
CTGCAAAAAACAAAACCTTGCACAAAGGCCATTGCAACCTTACACAAAAA 
ATACTACTTTAAAAGGACATGTGCCCAGCAACTGCCTGTCCAACCTCAGA 
CTGGCAATATCTTTGTTATTGATCTTAGTAGCCCAGCATAACTATTTCAA 
AACAGTGATGTAATGCTCATTTTTTTTCTTTTGAAAACTTTTGTCTTCCT 
GTAAAAACCTTTGTCTTCTTTACTTACCCTGAATATGCACAGAGTTTACT 
ATGGAGTGCATATTCCTGTTGCAATGCTCTATTCCCAAACAAACATCATT 
TTCTTTTAGAGAGCCTCTCTCTGTTTGTGATTTAGGTTGGTGATGTAAAG 
CAATGGCATAACTGAACACTGATTCAAAGAAAAGTGGCTTTTCTCTTTGT 
TGTATTAAAAAGAGGCCTTATAAATAGGATAGTAAGATTTGTAAGTTGAA 
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CTTAAAGCATGAAGAAAATTTAGGGGCCAGGCAGGGTGGCTCAidk'CCTuT 
AATCCCAGCACTTTGGGAGGCCAAGACAGGAGGATTGCTTGAGCCCAGGA 
GTTCAAGACCAGTCTGGTCAACACAGACCTCATCTTTACTAAAAATAAAA 
AAATTAGGCCAGGTGCAGTGGCTCATGCCTGTAATCCCAGCACTTTGGGA 
GGCCAAGGCGGGAGGATCACTTGAGGTCAGGAGTTCGTGACCAGCCTGGT 
CAACACGATGAAACCCCATCTCTACTAAAAATACAAAAAAATTAGCTGGG 
TGTGGTGGCGGGCACCTGCAATCCCAGCTACTCGGGAGGCTTCAGGCAGG 
GGAATCACTTGAACCTGGGAGGCGGACATTGCAGTGAGCTGAGATAGTCC 
CACTGCAC TCCAGC CTGGGCGACTCAGCAAGACTCTGCCTCAAAAAAAAA 

AAAAAAAATTAGTCAGGTGTGGTAGCACACAGCTGTGGTCCCAGCTACTC 
GGGAGGCTGAGGTGGGAGGATCATCTGAGCCCAGGAGGTCAAGGCTGCGG 
TAAGAGCTGAGATTGTACTACTGCATTCCAGCAGGGGCTACAAAGTGAGA 
C CCTGTCT CAAAAAAAGAAAAAGAAAAAGAAAATTATGTTTTTAAATTTA 
TAATTATAATAAATTTAATTACATAAATTTAAGCTCAAGTAATTGTAAAT 
ATTCTTTCTGTGCACATAAGTTATTCTTGTATTGACCCCACAGGAGCTGG 
CCATTCTTCAAGTCAGAAGGCCTGAGAGAGGAGCTGCCCAGGTGGTCTTC 
ATGGGGCTGTGCGGCCAGTCATCCCCCACAGGTTGACAATCCTTGTGTAC 
TTCATCCTCGTTGGATCCTCTGTATCCCTGACGATGAGCAACTGTGAGGC 
CCGTTTCAGCACTGAGTTCCAGTCAGGAAAACATCCACCCACCCACCACA 
CGCTCACACTTACACACACATTCACACATGCACACACGTTCTGGCTCCGA 
AAAAGAAAAAAAAAAAGCAATTTAAAATAATTCTGATCCTTTGCTTATTT 
CCACAAACTCCATGAAAATTGTACATTGTCCAAGCAACATTTCTTAATAT 
TCTCTTTTTCTCTCATATCCATTTTCCTTACTGCTGTCTCCACCTTTCTC 
TTCCAAACTCCCTGTTAAAATCCCTGCCCCAGCGAACTTTTAT7CAATTT 
TGTGGAATGGAGGCTGCTCTGATTTAAATTAAAAAAAAAAAAAAAATCCC 
TACTCCATGTCCCAGATCCCTAGTTGTTTTTTGTTTTTTGTTTTCCTGAG 
ACAGGGTCTTGTGTCTTCCATGCTGGAGTGCAGTGGCATGATCATGGCTC 
ACTGCAGCCTCAACCTCCTGGGCTCAAGTAAATCTCTTGCGTCAGCCCTC 
CCCAGTAGCTGGGAGTTCAGGTATGTGCTACCATGCCTAGCTAATTTTTT 
TCTTTTATTTTGTAGAGACACGGTCTTGCCAGGTTGCCCAGGCTGGTATA 
GAACCCCTGGGCTTAAGTGATCCTCCTGCCTCGGCTTCCCAAAGTGCTGG 
GATTACAAGTGTGAGGCACTGCAC CCAGGCTGGATC CCTGCATTTTTACA 
GATTTAGCATCACAAAAGTCTAAACAATTAGACTGACTAAGGCAGAACTG 
CCCTTATGACAGCAGACATAAGAAGGAAAAGGCCAAAACACTGTGTTAAA 
AATTATCCAAATGTGAGGAAAAGGCAAAGAGAGTAGGTGTGCC7TTTTAG 
TGTCTAAGCTGCCTGCCCAAGGGGCATCTGATGCTCTCAGGCAGGAGTCC 
ACAAATTrTTTTTTGTAAAAGATCAGATAGTAAATCTTTTCAGCGTGAAG 
AGCATGAGGTCTCTGTCACAAATACTCAACCACCATTACAACATGAAAGC 
AGCCAACAGACAACACATGACAAATGAGTGTGGCTGTGTTCCAGTAAATC 
TTGATTACAAAAACAGGCAAGAGGCCAGAGCTGACCCATGGGCCATAGTT 
TGCTGACCCCTTCTGTAAAGGAAAGTATTTTTGTTTGACTTGCTGTTTAC 
CATTGATTGAACACAAGGCTCTGTAGAGTTACTTGTTAACTTGCAGAAGA 
TTGATGAGTGGCAAGTAATTTTTATTCACCAGAATATANNATTATTCTGT 
TCAGTAGATAAGATAAACCCACTGTTATATTACTGTCTTGTTTAGAATGT 
GACTTTGATTCATTTTTTCACAAATTCATATTATTGCCCTAATTTGTATA 
TAAGTATGCTTCTTTTAAAAATATATATTTTTTAATAAATTTGAGACAGG 
GTCTCACTAGGTTGCCCAGCCTTTTGCTATAATGAGAGCATAAAGTGAAT 
TTCACACTTTAGCCTAGTGCATAGATGGGATTACAGGCACAAACCACTGC 

ATGCAGCTAACTTTGCTTCTCATTCCAGCACGTTCTATTCCNNNGNTTTT 

CATATACGCGTCTCTTAATGC 

>Concig40 

CGCATTCAGCCCAAGTTTTCTTCAGTGTTAAGGTTTTTGTTACTCTGTGC 
CCAAATGTCCTTCCAAAAAGGTTAAGTTTTTTTACCTTCCTGCCAACATT 
ATATGAAAGTGTCCACTTTTGTAGACTTTTACCAATGCTGACTACTTTTG 
GTTTCAAAAAAGCTCTCAGTAATTTTCTATTAATTACTTTTACCCTTTTT 
TATTGAGGGTGTTCAACTTTTTATTGTTAGCATATTCTCTCTGGGCTCCA 
TTGGACGCCTTGGCAGCTTTTTGGTAGTAGGTGCCTTTAGAAAAGTCCTT 
CTCGTC7GGCCCTTTCTGAGCAAATCTAGTGAACAGAATTGGCTCCATGC 
TCAGCATTGCTTAATACGGTTGATCCAGGGCCTAGGACTCATTCCTTCAT 
TACCATCCACTTGCATTGTCTTAAAGCAAGGCTCTATTAATTTAATTTGG 
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CATTTCCTGTCCCAGCTC1 , TTAGTTTCA.TTAAACAAAGGCTTTAG 

TCCCAGTAGATGCCTATGTTGCTTCCTTTTAAAAAATTTTGGAGCTGTTT 

CCCTAGCCTAACCTTTTCTTCAGGGCAGGAGTTAAGTCCCTTCTACTGCA 

TTCCTGTGAAGATGGTGATTCAAGAGGCAGGGCACCTGTTGCTTTGTGAA 

ACAGTCCACTCTG CAGC TGGGCAGCTCTGTTACTAGAATGTTCTCCCTTC 

TGGGGAGCCAATATTTTGATGTCCTCTGTGAATCTCATCTGCTTATCCCA 

TCTGTTTATGTCCTTGAAGATGCACAGGTCTGACACCACGAGGTAGCCCT 

TAGAAATTTGATGGCATTTCTGATGTGTCCCCAACTCTTCTCCAACCACT 

CCTCCCAGAGCTTGTTTCTTAAGCCCCTTGTGGAGCTGATTGCTTTCCTC 

AAGGCAGCTCAGTTTTTCCCAGTTTGCTCCTGGTGGTCCTGAAATATGAT 

TGACTCCTGAATACTCCAGGTGTGAAGGAGAGTGGGGGTGGCCTTTCTAC 

TTGTCATGGCCTGGGTTTTAAGTTGCTGTCCAGTGGAGCAGAGGTGACTT 

TCCCAGTGAACTACATTTTTTCCCCTCTAAATCCTTAGCAATTTTGTCTC 

CAGAGGCAAGACCTGGCCAAACCATTTGTGTTGAGGATTGAATCAAGAAT 

GATTGAGGAGATGACAGTAGTCCCCCCTCATCTGAGGAGGGCGTGTTCCA 

AGCCCCTCAGTGAATGCCTGAAACTGTGGATAGTACCCAACTETATATGT 

CTATGATTTTCCTATAAATTAATACATGCCTGTGACAATGTTTAATTTAT 

AAATTAGGCAAAGAGGCCAGGCGCAGTGGCTCAAGCCTGTAATCCCAGCA 

CTTTAGGAGGCTGAGGCCTCACCTGAGGTCAGGAGTTCGAGACCAGCCTG 

ACCAACATGGAGAAACCCCGCCTCTACTAAAAATACAAAATTAGCTGGGC 

ATGGTGGCAGGCGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAG 

AATCACTTGAACCCGGGAGGCGGGATTTGCGGTGAGCTGAGATCGTCTCA 

TTGCACTACAGCCTGGGCAACAAGAGTGAAACTCCGACTCAAAAAAAAAA 

AAAAAAATTAGGCAAAGAAAGAAATTAACAACAATAAGTAATGAAATAGA 

ACAATTCTAACAATATACTATAATAAAAGTTGTATGAATGTGGTCTCTTT 

CTCAAAATTACCTTTTTTTTTTGAGACAGGGTCTCACTTTATTGCCCAGG 

CTGGAGTGCAGTGGCACGATCACAGCTTACTGCTGCCTCGACCTCCTGGG 

ACCAAGTGATCCTCCCACTTTAGCCTCCTGAGTAGCTGGGACCACAGGCA 

TGCACCACTGTATCTGGATAATTTTGTTTATTTTTTTTTGCAGAGAGAGG 

AGGTCTCACTATGTTTCCCAGGCTGGTTTTGAATGCCTGGGCCCAAGGGA 

TCCTCCTGCCTTGGCCTCCCAAAGTATTGGGATTACAAGCGTGAGCCACC 

ATGCCTGCCCCAAAATTATCTTATTGTTCTATACCCACTCTTCTTCTTGT 

GATGATGTGAGGTGATCCATTGCCTCCTTGATGAGATGAAGTGAGGTGAC 

TGATG7GGGCATAGTGATGCAGTGTTTAGGCTGATATTGGCCTGATGATA 

TGTCAGAAGGAGGGTCATCTGCTTCGGTGATCCTGGATCATAGAGTCATG 

ATGATGTCAATGGTTGGATGTCAGGAGCAGACGATGTCAATGACTAACGA 

7AAGCTGGACAGGTGGGATGGTGGCACAAGATTTTATCACGCTACTCAGA 

ATGGAGCACAATTTAAAACTTCTGAATTGTTTATTTTTGGAATTTTTCAT 

TAATATTTTTGGATTGCAGTTGACTGTGGGTAACTGAAACTGTGGAATGT 

GAGACTGTGGAAAAGTGAGGGAGTACTGTATTATGGAACTGTAACTCTAT 

TCGGTAGGGGAACAGAATTCACATTTGtGGGGCCCAGGTCTCTGCATCTG 

TAGGGATCCAATTGTTTCATTTCTCGTTGTAGCAAAAACTTGGCTTTGGA 

ATCAGACAGATTGATGTTTGCTATCATTCTAAATGGGTGCAGCTACACTT 

TCCTCAAGAGGTAGTTCTGAAAATTTAACAAAATGTGAATTTCTTGGTAA 

AAAAAAAAAACCTCAAAAATATTCAGTTTCCTTTCCTTTGTGTCTGATGT 

ACTCCATCAAATACTGGGAAATATGTGTCTCTCATAGAAATGTCATGGAT 

CTTTGTAATTCTGATTATCCACAAACCTTGGGGATTAGCTGTTTCAATGT 

TCCTATTTTACAGATAAGAAAATGGAGCCTGTGGTAAGTTAAGTGAGTTA 

CTCATGGCTACTTAACTAATATTTTACTAGGTGATAGGCCAGAGCTAGAG 

CCCAGGTCACCTTCTTATCAATGCTCTGCCTTGTCTCTGTGCCTTCCTGT 

CTGTCTGTATGTGTATGTGCCTGTTGACAGTAAGGCATAGTTTAACCCAG 

TAGAACTACCGGTTTGTAATGAATTCCACTTGTAAATGACTGACCATTCA 

AGGAACAAGTGTTTTTTCTATGCTTGACACCTGTTTTGGATGCCAAAAAG 

GATACAAATGTAACTTCAGACACTCTGGGCCTCATTTTGCACTCATTAGC 

ATGTCCAAAATTAAAAAGACTGACCACACCAAATATTGGTGAGGATGTGG 

AAGAACGGGAACTTTCATACACTGCTGGTGGGGATGTAAAATGGTACAAT 

CCCTTTGGGTAACAGTTTGACAGTTTCTTAAAAAGTTAGACATATATATT 

TACCATATGACTCAGCCCTTCCACTTCTAGGTCTTTACCCAAGAGAAATG 

AAATGCTGTGCTTTTACAAATGTCTATACAGGAATGTACATAGCAACCTT 

ATTTGTCATTGCAAAAAACAGAGACAATTCAACGTTGTCAAGAGTGAATG 
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GATGAGCAAGCTGTGGTX 'GTCTATGCA*. j. GGTATCCTACTCAGCCS.Gi 
AAAGATATGGCTAAT ' 
>Concig41 

GACAACAATGTCATGCATAAGATGACGATGGCCTGGGTGATTGATGCAAA 
CAAGGATAAAGAAAATAATCAATTTTGTCCCCATTTTCAAAGACAGATAG 
CAGCAGCAAGAGTGTAAGTCTGAGGAAAGTCATATTCCTTCCTCCTACAA 
CATAGCACACACACTTACAAAAACAATACACAGACTCCTGGCCAATGGAC 
TTCAAAACTGAGGAGGATCATTAAATTTAAATGTTCACCGCTGCATGAAA 
TCTCCCTGGGTCCTGCCCTCCCTTCCCCACCCTCCTCCACTTGGGCCGGG 
GCACAGCAGTGATTCTCTCACCTCTCAGAGTGAGCCAGTGTTGGCTGCAT 
TGAAGGCTCCAGATATGCAAACAGGGCAGATATTCCTGGACCAGGGTGCA 
CAGAGTGAGGCTCCAACGCACCCTATTAACTGCATGAAGGATGAATGAGC 
CTCTGGTATGGGCTGGGACAGAAAAAAGGATTCAAGGGGCCCAAAAGGGT 
TTGGGTGGAACCTACCAGGAGCGGCAGTACAGACTCCTTGGGAAGGTGGC 
CATGATTTAGCCACATTCACCAATAGGATAATCTGGAGAATTTCCTAGCT 
TGAGTTTCTGGGAGAAAGCAGATTTCTGGATTATCTGGTGACAGGTAACA 
GGGCCGAGTTCATCCACAGCCACCTGCAGTGTTAGCACCTTAAGCTGAGT 
TCCTTGCACCAGGATGCTGTCACGCCCAGTCAGTGTGAGACGGTTCTTGG 




GGGTTAACAGAGAAAATGAAGGTACAAGTGCCTTGTGAACTCTGAAACTC 
CAAACCAGTCATTCTCAAAGTGCCGTCCACCAGTCTAGCACATCAGCATC 
ACTGGAAGCTTGTTTGAAATGTAAATTATCAGGTCCTCCAGAGCTATGTA 
TGAATTAGAAACTCTGGGAATGGGGCCCTGCAATCTATTTCAACAGGTCC 
TCCAGGTGATTCTGATGCAAGTTAAAGCCTGAGAAACTCTGTCCTATACA 
AATGGATGTCAACTCAAGCTGCTCTTCAGAATCACCTATAGCACTTGTTC 
ACCCGAATCCCTGAGAATGGAGCTTCAGGACTGCTATTTCTCAAAGTT T G 
CCTGGTGATCCTGAGATGGGGTTTGGGGGACAGAGATCCAAGGTGCTACC 
AGGTGTGAGGAATTGTTAGAAGGCAAACCTGGCTGTCATCTAGGGTGCTT 
AAAGGGTACAGATCCTAGGATTCTGCCTCTTACAGCTGAATCAGACTTTC 
CTAGAATGGGATTGCTGTCCAATGGCATGCCTCCTGGGTGACTCTGATGT 
ATAGCCTGGGCTGGGAACCACCAGAGGATTATCTTCCATTGACCAAGCTG 
ACAAACTCGCTTAAGGCTCTGAGTTTCACACTTGATTTTCTAGCCCCTGT 
CCTTCCATGGATCACCTGCCCCCTTCCCTCCTAATCAGGAGCACAGTCAG 




GAAAGATCCAAGGACAAGAGACTAGAAAAAAATTTAAAACAGCCAAGCAT 
TGGCC CAGT AATGGCATTTCAG AAAT C CAC CAAAAT ATTAAGATG CTT" 7 *^ 

TGAAAAATATCCAGAGCACTCATGTAAAAGTGCTTAATTATTAATAAAAG 

CTGACATGTGTTGGGTACTTCCTGTGGGTCTGGCACTAGGCTAATTATGT 

TTTTAGGAGTTGACTCAAATGCTCCCTGTCATAATTATGTGAAAAAATAT 

AATTATTAGCTCCATGGTAC^AATTAAGGAGAGGTTACATAAATAAAAAG 

GAATGATACTCAAATTAGTAACCAGAGCCCATGCTCTTAAACACTATGCT 

ATTATTTGTGGACTCTTACATAGGTGGCAAAAGTCAAAGGCTAGATTGAC 

TTCTGTCCACTTCCAGCCAAGATGAAGTACAAGATTCAGATACACCCTTC 

CGCATTAAACAACTTAGGAATCAGACAAAATATACAAAGCATTGTTTGTT 

ACACATTGGATAACAGACAGCACTAGATAGTCGTGTCTGAGAAAAGCGGT 

GAAATGAGCTGAGTCTTAGAATTGCCCCAGTTTACTAAGGGGCATAGTAA 

GGGCATAGCTGCAGCACAAAGAAGCAGAACCCAACAGAGACTGGCGT^ 

CCTGAGTTGAGAAAACCAAGTTGAAAATTTAGGAACACTAACACAGATAT 

GTAGGCAAGAGTATCAGAGAGGAGACAGTTGTAGGGAAAAAGAGAGCTTT 

ACAGAGAGACAGCGAGAGCTCCAGAGACCCGCAGAAGATTGCCCTGACGT 

CACTAGCTGAGTACCGATCAGTGCATACATGTAAGGATATTACTCAATAT 

GTGGAAAAGAACAGAAGGAATGATGTCCAAAGCTCACCCAAAGACAGGAA 

TCATTTATGTTTCCACCAGCCAGAGTGGAACAACCTTGTAACGCATATGG 

AGTACTCAAACGAATATTTCCTCAATAATAAGTTCAAATTAACTGAGACT 

AAAGCCTGCCCGCTTTGTCTGGACATGCCTAACAAAGCTTTGAGGGAAGC 

CTCAAAAGAATGAAACCGTGTCCAAGTAATTTAACTGTGTCCCAGAAAAA 

AATTCAAGAACATTTAAATAAATATTAAAATATGATCAAAC CCAGCAAGG 

TTAAATTCAAAATGTCTGGCATCCATTAAAAAATTACCAGCCTTGAAAAT 
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TGGCGGGAAAA7ATTA; -JATAATGAA.. . JAAAAAGCAAT CAAGAGA/ 

AGGCCTAGAAAGTATACATATGATAAAATTAGCAGACATTAAATGGTTAT 

GATTAATTTATTTTATATGTTAAAGAAGGTAGAGAAGAGCATAAGCACAT 

7AAAGAGAGACAGGAAAGTCCCAGTACTCACACAGGGCCAGGAGCAGTTT 

TCACCAGTCAGGTGGGAAAACTTCATATTTCATGGAGCATTGGTAGAGTA 

CACAG7GTCTTGCCTTAGTAGAGGGATAAATGCTGTTCTGTTCCCGCCTA 

ACCCATCTTGAAAGAAAATCTGAAAGGATCAAACTGTATTCAAGTAACCT 

AATCACATCCCAGCACACAGCTCGACTAGTTATAAAAACACAAAATATTA 

ATATCTAGAAACACAAAAATAATATCTAGCACCCAACAAGGTAAAATTCA 

CAATG7CTAGCATTCAATTGAAATTTTCTAGGCCATCAAAGAAGCAGTAA 

AATATGACCTATAAGGCCGGGCACATTGGCTCATGCCTGTAATCCCAGCA 

CTCTGGGAGGCCAAGGTGGGTGGCTCACCCGGAGGTCAGGAGTTCAAGAC 

CAGCCTGGTCAACATGGTGAGACCTCATCTCTACTAAAAATATAAAAATT 

AGCCCAGCATGGTGGTGGGCGCCTGTAATCCCAGCTACTCAGGAGGTTGA 

GGCAGGAGAATCGCTTGAACCTGGGAGAAGGAGACCGCAGTGAGCCAAGA 

TGGCACCAATGCACTGCAGCCTCATTAGAGAACATCGGGAAG 

>Contig42 

GAAACTAAAGGCTTATTTAAAGCGCGAGACCGTGGCGCCTTTGGACTGGA 

CCCTTTCTAATGATCATTTAGTATCAGGCTATGTGGGAGTTGACCGTTTT 

GCATAGCCTGAAAGCCAACAGTATCACTCCTCCTCTAGGTGTGGCAGAGA 

TGTGAGAGAAGGAGACTGACAGTCTGTGGGTGTGTATGCAGTGTTGGGGG 

AAGCGAGGCACAGGGGACAATACTGTGGTGTAGAAAACTAGTCTAAGGTA 

GCATCAGGAAATTCATGAAACCAAAATGAATTTCATAACAGCACAAGACA 

TTATTTGTiTTTGCCTCCCTCTCATTTTTTTTTTTTTTTGAAACAGAGTC 

TTGCTCTGTCATCCATGCTCGTGTGCAGTGGTGCAATCTCGGCTCACTGC 

AACCTCCACCTCCAGGGTTCAAGCAATTCTCATGCCTCAGCCTCCTGAGT 

AGCTGATTACAGGTCTGCACCACCCCGCCGGCTAGTTTTTGTATTTTTAG 

TAGAGATGGGGTTTTGTAATGTTGGCCAGGCTGCCCTGTCATTTTTTTTT 

TACTAGTGTCCAGTGGAGTTTTTTAGGGGCTACATAACATGATACTGTCA 

TTAATCTAATGGCTAATGAAAGGGATATGTATATGTTTTTGTGTTTAAAA 

CAAACTTCTTTGGGGTCCTCAATAATTTTTAAGAGTATAAAGGGGTCCTG 

AGATCAAAGAGTTTGAGTTCTGCTGGACTGGGACAGTGGTTGTCAACCCA 

GATTGTACATTAGGGTCATCTGGGAAGCTTTAAAATAGTACTGATGCCCA 

ACCTTACCGCAAACCAATTAAGCCAGAATCTCTGTGGATGAGAAGTCTTC 

ATTGTCATCATCACCATGACCATCATCATTGTCACCGTCACTACACCATT 

ATCATCATCATCATATCATCTTCATTATCATTGTTAGTATCTCCATCACC 

AT C ATCAGCAT CAC CATTATTATCATCATCAT CATC C C CAC CATCAT C CT 

CATCGGAACTTCACCTGCATGGAGGACAATCCACTATGCATTAGGTGCTA 

TGCTATTTGCTATACTCCTTATTCTCACAACTGCCCAGAGAGGCTGATAT 

TATCTCACTTTATAACAGGAGGAATCTGGATCGGAAAAGTTAAGGTAAGC 

TAATTCACAGAGCGAGAAGAGATAGAGCCAGGATTCGAAACCAGTTCTCT 

GCTACATCAATGTTCCCAGTCCTTGCACTATTGAGAACCTCTTTAGTTAT 

GCTTTCACCCCTCCAACACCACAGTAAATTTTTTCTTTTTTTAAAAAAAT 

TATACTTTAAGTTATAGGGTATATGTGCATAATGTGCAGGTTTGTTACAT 

ATGTATACATGTGCCATGTTGGTGTGCTGCACTCATTAACTCGTCATTTA 

CATTAGGTATATCTTCTAATGCTATCCCTCCCCGCTCTCCCCACCCCATG 

ACAGGCCCTGGTGTGTGATGTTCCCCACCCTGTGTCCAAGTGTTCTCATT 

GTTCAGTTCCCACCTATGAGTGAGAACATGTGGTGTTTGGTTTTCTGTCC 

TTGTGATAGTTTGCTCAGAATGATGGTTTCCAGCTTCATCCACGTCCCTA 

CAAAGGATATGAACTCATCCTTTTTTATGGCTGCATAGTATTCCATGGTG 

TATGTGTGCCACATTTTCTTAATCCAGTCTATCATTGCTGGACATTTGGG 

TTGGTTCCAAGTCTTTGCTATTGTGAATAGTGCCACAGTGAACATTCATG 

TGCATGTGTCTTTATAGCAGCATGATTTATAATCCTTTGGGTATATACCC 

AGTAATGGGATGGCTGGGTCAAATGGTATTTCTAGTTCTAGATCCTTGAG 

GAATTGCCACACTGTCTACCACAATGGTTGAATTAGTTTATAGCCCCACC 

AACAGTGTAAAAGCATTCCTATTTCTCCACATCCTCTCCAGCACCTGTTG 

TTTCGTGACTTTTTAGTGATTGCCATTCTAACTGGCACCACAGTAAATTT 

TTATAGATTTTATAAGCAAATTGTATTTACTGTGCAAGAATTGGTTTATT 

TTTTAAACCATGTGTTGCAAACATACAATGGTTAATTGTGATATTTGCTC 

AGTACAAGATCATCAGATCACTACACAGACTTGAGGTAATTCCACCTAAA 
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AGCAAAGAGAACTGACCCCACATT^ ^ 

CCCTATAAACGAGCCAATATGAAGAQAAGGCCTTAATGTGGTTAACTATG 
TAATTTTTTTCTGACTTTTTGAAATACTGAGAAGAGCTCATGACTCTCCC 
ATCTCCTAATTCTACCTTGGTGGATTTTAGACTGACCACAACTCATGGGT 
AAATGAGGGAAGACGAATAAGAAACCTTGCTTTTTTTTCCTCCTTGTTTT 
TGGCTGGCTGCAGTGGCTCACACCTGTAATCTCATCACTTTGGGAGGCCA 
AGGTGGGAAGATCACTTGAGCTCAGGATTTCAAAACTGGCCTGGGCAACA 
TAGTGAGACCCCATCTCTAAAAAAAAAAAAAAAAAAAAAAAAGGCGACGG 
GCGGTGCGTGCCTGTAATCCTACCTACTCAAAAAGCCGAGGTGGAAAGAT 
CACTTGAGCATGGGAGGTCAAAGCTGCAGTGAACCTTGATTGCACCACTT 
CATTCCAGCCTGGGTGACAAAGCAGGACGCTGCCTCAAAAAAACAAAAAC 
AAAACCTTAATTTTTTGGCTATTCTTTTCTGGTAAGAATGGTATAGAGAT 
GGGGATGAGGATGGCTATTGTATGAGAGAGCAAACAGGGTCCAAGCAGTG 
CTCTGGGCTGTCTAAGGACCAGTAGTCAGCTTAACTTCTCAAATTTCCAG 
GGAAGGAGTTCGGAGTGGTAGAATATCCTGGGTATGCCCAAAGCATCACC 
TTGCAAATAGCCTGTCATGAATAATTTGTTTCATTTGTTATGACTGGAAA 
CTGGCTTTGTGTATGCCAGAGAATGGGGGCAGGAAAGAGAGATTGGTGTC 
TTGAGCTCTCTGTGCCTCTGGGGCAGTGATGCTTTTCCTCTCATGTGGAA 
GGAGAGCATGACTGAAAAGGTGCACAAATAAGGTGTCTGTGAGAGAAATT 
AACCTTCCAGATACAGAGACACAACCTTCCCCAAGAGGTCCTCATTGCTC 
TGCCTTTTTTCCTTTTTTTTGCTTGTTCTACCATTAATAACAGAAACTGA 
TTATGACCTCAAAAGAGAGGAGAAAGCGACTCTCCCCACCCTAGAGCTAG 
TTAACCACCATATCTTCCTAGATCTCAGTTCAAGAGTCACTTCCATCCCC 
AATAAAAGCCCTTGAGTGCTGAGCACCTCTCCGTCATAGCATTTGTCCTA 
GGGGTTTTTGTACATTTTCTTGTGTGAAACTTGGGTTGACATCTGTATTT 
CCGACTAGATTACAGTTTCCTCAAGGGTAGGGATGTCTTGCTTGCCATTT 
TCAGTTCCAGCATCTAGACAGTACCTCAAGCAAACAAGGCCGAGGGGGGT 
GCGGATCACGAGGTCAGGAGTTCGAGACCAGCCTGATGAACATGGTGAAA 
CCCCGTCTCTACTAAAAATATAAAAATTAGCCAGGCGTGGTGGCAGGTGC 
CTGTAATTCCAGCTACTCAGGAGTCTGAGGTAGGAGAATCGCTTGAACCC 
GGGAGGTGGAGGTTGCAGTGACCTGAGATCCACTGCACTCCAGCTTGGGT 
GACAGAGCAAGACTTCGTCTCAAAAAAAAAAAAAAAAAAAGAAAGAGAAA 
AGAACATCAAATGAATGAATGAGTGAGATGAATGAGTTAGCAGTGTTGGA 
TTTAAGTGTCAGATTCTTCCCAGCTTGACTTTTTTCTTTGGCTTAGTGAT 
TTTGAGGTCNCAAGATTTATTTTCCTTTCACAAAGGTGATCACTACCATA 

AGATCTTCAGAAAAAGAATGTGGCAAGCCANGTCTCACTAATGCAAATCT 

CTATAACAACTGTATCAGTACT 

>Contig43 

GAGGTGTCATAAATATGGACCGATAGATGAATACAGGTAGGATGGGACAC 
AATCTAAGATCCCAGGGGGGGGAGACCACACGCTTGGTTAGGGAGACCCA 
AAGTGGACCGTGTGGCCAGAAGAGTCCCGCACTGCACTCTAGTGACAGTG 
CAGAAAGTCACTGTGGGAAATCTAGAAGTTTCTACAGGTTGCTATTTCAT 
CATAGCACTGTGCAGGCCAACCCTTCCTGCTCCACTGGCTGTTGGGAAAA 
GCTTTCTCTTTTCTTCCTAGCCAGGGAGCTCTCAAAGTGTTCCACTCTCT 
CACCTCCACCCAGGCGTCCAGGTGTGGAGGACACTTGCCGGCTGCTTGTC 
TGCTGACTCATCCCTTGGTTTCACTTGGAAAACCTACCACCAGCTGGCCT 
CTTTCCAAGCATCAGCCTCCTCATTTTCTTAATCCCTTAGGTGTGATCTC 
ACCTCCACACAGTAGATTGCCTCAAGGCCCAATTCCAATATGAATAAAAA 
TGATTATTTTGTCATCTTCCAATCTTCCTTTTAAAATATTATTTTATAAT 
TCCCTTTAGGAGGATCACCTAAGTGAAGACTATTTTTACCTAAGAAATGT 
TAAAATGTAAAGACATGGTTGTAATCTGGGGATTCCTGTTAAAATGGCTA 
GCAGACAGAAGTCAGACGACAGGCTAGAAATGTGTGAAGAGTGGTTGCCT 
TTGAAAGGCGGAGTTGGTAATGATTTTCTTCCATTTTTCCATGCTTTCCA 
ATTCTCTACAAAGGCCTTAATATTACTTCGATAACCAGGACCTCTGATAA 
CCTGCCCCCACCGAGTAAAGACTTAGCTGGGAAAGTCAGCTTCATGTGAG 
GTAAAAGGAACCAGGTAATACACAATTCCCACTGCCAACTGTCGGGTGTG 
CAGGCCTGAGCTTCCTGCATGTGGGAGGAAAGAGAAAGAAGAGAGAAACT 
C CAAGATC CAAGAGATCCAGCAAGAAGG CTGGAGT CTGAGGAC G CAGAAA 

GCTGAATGGCACAGTTACCACTATTGTGCTGAGGTTCTGTGGCCTCTGGG 
TCTCTTGACAACTGGGCAAAGACCCACAGAAAACTATCTCTAGACCCTAC 
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CTGTGGGAGGGGAAAG1 ^ JTTCAGATCAx CTACAGGACAGCCAC'CTrGGk^ 

CTC^AATGGCTTACAGTTCCTTCATCCAGAGGGTCTTCATCTAGTACATA 

CCAGGTGCTAAGCCTGGGTGCTGGAGACATGACGGGGAACCCATTTACCA 

TGGCTTTGTTACTGTGACATTCACATCTAGGGAAAGCCAGCAAAGGGGAG 

GGATCGAGGAGAGCTTGTTAGGCAGAGAAAATACCCAAGGGCAAGGGAGA 

AGCCAGCCTGTTCTGAGCACACACAGTGGTTCCATCTAACTGGGCCTCAG 

TGiTCAGGTTGGACTGGAGATGGGGCTGAGGAGCTGTCACAGAGCATTCTG 

GACACAGATGTCACATAGTCCCTTGAGGTTAGGGTCCTTAGGCATGGCAG 

CAT7GCTTTGAGTTTTTCCTTTTGTAATGTTGCCATTCATGACAATGTGG 

AAGATGGGTCCTTGCAGAGAAGGGCAGGGCTGTGAGACCAGTTAGGAGAC 

TAAGATGTGAGCCAAGGAAAATGAGGAACACCTGAACACTGGGGCAGGTG 

CAGGGCCCAGAGAGAAGCAGATGGCTTCCTGAGGTTTTAAGTAGGTAGAA 

TCAAGGCAGCTGGTAAAGATCTTTTATTACATATAAACTGGAATAAGCCA 

TCTGCTCCAAGACAAAAGAGTAGGCGGAAAACAATACAAGACAGAAATGG 

AATTAGAACAAACCTGGGAGGAATGTGGAATTAGAGTAGAGAGTCCAACA 

CTGGCTGCAATCATAAAAATGTAAAACAAACAAAAATTTGCTAGGTGTGC 

TTACTTAGAAATAATTAGCTGTCATATTAAGTTCACTTGTGTTATGGCTT 

AAATGTGTCCCCCAAAATGTGATGTGTTGGAAACTTGATCCCCAATGCAA 

CAGAGTTGAGAGATGGGACCTTTAAAAGGTGATTAGGTCATAAGGGTTCT 

GCCCTCATAAATGAATTAATACTGTTATCATGAGAGTAGATTCCTGATAA 

AAGGATGATCTCTGCCTCCTCCCCACAGCCCTCTTGTGCATGCTTTCCTG 

CCTTTCCACCTTCTGCTATGGGATGACACAGCAAGAAGGCCCTCACCAGA 

TGCAGCTCCTTGATCTTGGACTTTCCAGCCTCCAGAACTGTAAGCCAAAC 

AAATTTCTGTTTATTATAAATTACCCAGTCTCAGGTATTCTGTTCTAGAA 

GCACAAAATGGACTAAGATCATTAGATTATCATTTTTTATCAGACTGTTG 

AAGTGAAAAATAAAAATCAAATAAAGAAATTAAGAGAGCTGCATGCAGCA 

GCTCATGCCTATAATCCCAGCACTTTGGGAGGCCAAGGCAGGTGGATTGC 

CTGAGCTCAGGAGTTTCAGACCAGCCTGGGCAACACGGTGAAACCCTGTT 

TCTACTAAAATACAAAAAACTAGGCCGGGCGCGGTGGCTCACGTCTGTAA 

TCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCATGAGGTCAGGAGATC 

GAGACCATCCTGGCTAACAAGGTGAAACCCCGTCTCTACTAAAAATACAA 

AAAAAATTAGCCGGGCGCGGTGGCGGGCGCCTGTAGTCCCAGCTACTCGG 

GAGGCTGAGGCAGGAGAATGGCGTGAAACCCGGGAAGCGGAGCTTGCAGT 

GAGCCGAGATTGCGCGACTGCAGTCCGCAGTCCCGCCTGGGCGACAGAGC 

GAGACTCCGTCTCAAAAAAAAAAAAAAAAACTAGCCAGGCATGGTGGTGT 

GTGCCTATAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATTGCTTGA 

ACCCAGGAGGTGGAGGTTGCAGTGAGCTGAGATCATACCACTGCACTCCA 

ATCCAGCCTGGGTGACAAAGCAAGACTACATTTCAAAAAAAAAAAGAAAG 

AAAAAGAAAAAAAGAAAAGAAAAAGAAATTAAGAGAAGGGCAGGTATTAA 

CCCCAAATATCCCACCATAGGGACACATTAAAGTTTGCTTGGCCACTCCC 

CTAGCATAATATATGGAATGTCTTCAAGGACCCTCTGTTGTAAATACAAG 

GCCCTGCTGGACTTAATACAACCTGCAGGCTTTGAGATCCCTACTCTGTT 

GCCATCTCTCATAGGATTTGCAGACCAAATCCAAATACTTAAAATAGCAA 

CACTCACAAACATGCAAATCAGAGCAGAAAAGAAACTTCTAAAAGGCCCT 

GAAACTACACTTTATGAGAGAAGACAATAGGGACCTGAGGGTGGTAGAAT 

TTTCTCTCTATGCATCTATGTTTCCAGGGCTCACTTTCTCAATAAACTCT 

TAAATTGCTTTTAAAGTAAGGGAACAAGCAAACATTACATTTAAGAGAAA 

TCAATTTCATAAAGAAGGGGGGATGTCCAGGGTACTTTGCTTCCATGTTT 

TGCTTCCATGAATTTGTGTTTAACAGAAGATGCAGAAAAACACACAATTA 

TTGCAAAATCAAGGAAATCCACTCTAAACATCCCTTGGTTTCCCAGGCCA 

GTGTCACAACTGAAAACACATATTGTGGCTAATTATGTGTCACAAATTAG 

AATGACAAGGCAAGAAAAAAAAAACTCTCTGATTAACTAATAGCAGCCAA 

CACAGACAGCCTGTGTAGCTCGACTCTGCTGGTTTATAAAAGGCAGAAGA 

AGCAAACGGCTTCTGTGACCGCAACAGGAAGGGCCTCTGCTCTTAATAAA 

TAAATAACATTTAAATTATTCTCCCCCATTTGCAAAGCATTTTCCAACTC 

ATTATCTCATCTGACCAGGTATTATTGTATCTGACCAAGAACTTGTATAC 

NAAATAAAGAATAAAAAATAAATATGGGCCANGCACAGTGGCTCATGCTT 

GTAATCCCANCACTTTGGGAGGCCCAGGCGGGTGGATCACTTGAGGTCAG 

TAGTTTGAGACCAGTCTGGCCGACATGGCGAAACCCCGTCTCTACTAAAA 

ATACAAAAATTAGCCCGGCATGGTGGCACATGCCTGTAATCCCAACTACT 
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TGGGAGGCTGAGGCACG^iSAkTTGCT^ 

GTGAGCCGAGACTGCGGCCATTGCCCTCCAGCCTGGGCGATGAGAGCGAA 
ACTTCATCGAAAAAACAAAAACAAAACAAAACAAAAAACACCTTAGAAGA 
AGCGTTCCTCCTCTTGCTTTCTGAAGACACTCTACGCTGAAACAGTAACT 
TTCAATAAACCATCTCTTCTCACCGCACTCTGCGACTTGCCTTGAATTCC 
TTTGTGTGCAAGATCCAATAAGCCTCTCTTGCGGTCTGGATGAGAACCCT 
TT^TTTGGAATACTCTGACACAACAAATTGCAGAAAGAAAGTCTCACATG 
TATAAAATAAGCAAAAAGATTCTCTGGCATCTGAAGAAACAATTTCCTTG 
TCAATATTAGTATCACTATAAGTGTAGAACAACCTGTTGTATGATGCTAC 
ATAAAGTATATGAATCTGAATACTGTTGGATACAAAGGGAGACTATNNAA 
TGTAATACGTCGCCCGAAATGACTACACTGTTGGTGATCTTTCTTTCAAG 
AAGCANAATATTGCCTCNAACATCCTGTACATGGTATAAAATTTTA 

>Concig44 

CCCAGCAAGAACACCAATACAACGGGGGGGGCGTTCTTTGTGAGGGGTGG 
GGAGGTCAATTTTTTGGAACCTGCAGCAGGTAACACACAAAACTTCCACA 
GCTGCTACCAGCTTTCCAGGAGAGCCTGTGTACCTGGAGAGGAGAAGGCA 
AGTGCTTCCGAACTTGACTTGATGTCTTAGATTCTGCAATGCGTAGTCTG 
TAGGGACAGGCTGTAGCTTATCCTATAGGCTTGGGCTGGAGTCAGCAAGC 
ATCTGGGCTGGCAGAAGATAAAAGATGCAAAGGTGGAGGAAAGCATACGT 
GGTCTGGAAGACAGACTTGGTGGGTGGGTGGCTGCTACAACACCCTAGTT 
AGAGGTAGAGGGGTAAGTCAGTGTGTCTTCTGCACAGGCCTCTTCCCCAC 
CTCATTCTTCATTTCCCATACAGCCTTGCTGAGTTATTCACAAACATCTG 
ATTCAAC7GGAAGCTGGGTTGAGGATGACCTAAAGGACTAGTGTGATGCC 
TGCCCAGGGGTGTGGGCCCATAGTCAGAGTCCAGAGCCTCCTCTCAGCTT 
TTAGCACATCT CAC CCACATCCTGGGTCCTTAATTAGCAATATGAAAGCA 
AGCCAAGTGACAAGATTTTGTCCCTGGGAAGTCCAGAAGCACTCCTTTTC 
TCATTTGTATAAGCATAATGATTTGCTTACATAAATAATCATGAAAATTC 
AAATCCCTCTCAGAAATCAGGTCATAAAACCATGAAATGCAGCATGTGGG 
CAAGAATCACAGGGAAAGGTAGGTCTTGGAAAAGAAAGGATGGCAGGGAG 
GAAGAAAGCAGGGTGCCAGGGGCCCTGGGCTGCTGTCGAAGTCAGGTGGC 
TCACCGTCTCTGAGAACATTTCACTTTCTGGTAAATGGGGCAGTTGGAGA 
TAGAAGGGTTGGGTGAATGCCAAGAGTGAGCACAGCTGAGGTCAGTGCTG 
TGCCTGCAGTCCAGGCGGGAGTAGAAATCCTGGGCCCATCTTACCTCCGA 
CCTCATTTCCTCCTCTGTAATAATGTGGGGGTGGGGGAAAGTTCTGGTCA 
TCAGCCCTAGCATTCCATGGTTCATTTCCTCATCAGTGATGGAAAATCAC 
CAAGCAAGAGAACAGGATGGAGAATAACCGGATGGGTGCAATCGGAGGTG 
CTATTTCAGGTGAGGTGGCCAGGGAAGGCCCTCTGAAAGGGTGGCTTGAG 
CAGGTGG CTGAATGTACAGAAGCTGCCAATCATGAAAGATCTGGGGTACA 
GCATGCCAAGCAGAGGAAATGCGAGTGCAAAGGCCCCGAGATTGGATGTG 
GGCTTAGCACAAATGTGGCATGGCAAGAAGGCCAGTGTGGCTGAAGCAGC 
ATGAACAATGGGTGGAGGGGCTGAGAGGACAGAGGAGCAGGAAAGAGCCA 
GGCTTGGGTAGGAGAGGTGTCAACTTGATATATGATGCAAAGCCCTTGGA 
GGTT C C CAAACACAAAAGCAATGATCTAATATATGGTTTTAAAAATGCCA 
CTCTTGGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAG 
GCCGAGGCGGGTGGATCATGAGGTCAGGAGATCGAGACCATCCTGGCTAA 
CAAGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCGGGCGCG 
GTGGCGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAAT 
GGCGTGAACCCGGGAGGCGGAGCTTGCAGTGAGCCGAGATTGCGCCACTG 
CAGTCCGCAGTCCGGCCTGGGCGACAGAGCGAGACTCCGTCTCAAAAAAA 
AAAAAAAAAAAAAAATGCCACTCTTGCTGTGAAAAATTGACCCTGGGGGA 
AGGAGGAGTAGAAATGTCAAAAGTGGAAGCAGACCACTCAGGAGGTCAGG 
GCAATGGACTGTGCAGGAGAGACTGACATCTTAGACTCGGGCAATAGGAG 
AGAAGGTGGTGAGGATTATATTCTGGGCATAAAGGCAACAGAACTAGCTG 
ATGGCGTCAACGTAGGAGATGAGGGAAAGAAAGAAATCAAAGGGCATTCA 
TAGGTTTGAGGGTTGAGTAACTGGGGATATTTAACAGAAATGGAGAAGTC 
TGGGGAAGGGGCAAGTATTGTGGGGGCAGGGGTCAAAAGTTCTGTATTTT 
GGCCAAGTTAATTAATATTTGAGATACCTCTTAGGTGTCCAAGTGAAGAT 
GTCAAACAGTCAATTGAATACAAAATCTGAATCTTAGCCCAGGATGGTCT 
CACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGAGAGGATCACTTG 
AGGCCAGGAGTTTGTGATCAGCCTGGGCAATAGAGCAAGACCCTGTCTCC 
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ACACACACACACACACA . AAAAAGTCA x ^CAGGCATGGTGGCAGATGCw x 

GTAGTCCCAGCTACTCAGGAAGCTGAGGCAGGAGGATCACTTGAGCCCAT 

GGTTC^GGCTGCAGTGAGCTATAATCAC^TCACTC^TACTACACTCCA 

GCCTGGATGACAGAGAGAGACCTCATTTATTAAAATAAAATTTAAAAAAA 

TTAATTAAAAATAAATCCAAATCTTTCCTGAGATTCATATTCAGGAGTAA 

CTGTCATGTAGAAGGCATATAATGCCATGGGTCACATGATACCATCTAAT 

GAATGCCACTGGAAAAGAGAGAATAGCTAAAAACTGAGCACTGGGCACAC 

CAGCACAGTGAGGTTGGAAGGAAGAAATGGAGCTAACAAAGGAGACAAAA 

GAGGAGTAGCCAGTGAGAAGAGAGAAACATCTGGAGAGAAGAGAGAGCAG 

CAAAAGGTGGGTGAAGGAGAATGTGGTCCACCAGGCCCAACAATGCTGAG 

CAGTTGAGTAAGTGAGGACCTGGCCACTGAATTTGGCAAGAAAGAGGATG 

TCAGCGGCCCTAGAACAAAAGTGAAGAAGAGCTTGAGGACGGAAGCCTGA 

CAGGAGTGAACTGAGGAGAGAATGAAAGGTGGAGACATGGAGCCAAGGAG 

CACTGAGACTCCCTTGAGTAGTTTTGCTGTAAAATAAAAGTGAGTGCAGA 

GACGGGGCAGGGGGACAGAGAAATGCAGGGGTAGCTGGAGGGAGCCACAG 

AATCAAAAGAGGGTTTTTGTGTTTAAGATGGTAGTTGTCACATAGCACAT 

TAGTAAGTTCATGTGAATCACAACGTAGGTGAGACAGATCACTAATGCAG 

GAGTCAAATCCTTGCAGAGCCCCCAGAGGAGGTGATGAAGGGAAGTGATG 

GACATCATTCAGATGCAAGTAGGTTAGCAATTCCTGGGGTACAAATAGGA 

GGTGACTCCTTTCTGATTGCTCCTGTTTTCTGAATGAGATAGCACATAAA 

GTCCACTCAGCCATGTTAGCTGTTGAAGTCCTTGTGGCTGTCATGCCTGT 

ACAGACTGGGCTCTCCTCTCCAGCATTTCCTCTCAGACTAAGCTGAGCTG 

CACTAGCCGCTGCCACATCCTCTTGGGGCCATCCTCTGCCACACTCCACA 

TATTGCTGTGGTTTGCTTGCAACCCCTGGAAGGTCCTACTGGCTGCTCCT 

AGAAGAGTCTGGGCGGCATCTCTCCCTTACTCGTTATCACATGGTGCTGT 

AAGCAGTGGCCACACACTTTAGCTGGTGGGATGGGCCATCACAGGCAGTA 

AATGCGAAAGACTGCTCAGATTTTAAAGCACCCATGAATCAGTAGAATGA 

GTTTAGAATTGTAGTCATCAACACACATTAAAAAAAAAAAAAACAGGCAC 

TAAAAAAATTAGTTGAGTAGGATAAAGCCATAAAAGATATTAACTACAAC 

CCAGATAGGAGGTGCAAAATTGTC CTTACATAAATCAGATGGAAAAAGTT 

GAAAGCAGATAAGATAAAATAGGTAAGCATGACATTTAAAAGGTATTCAT 

GGGACGTGGTTACAAAACCAACTCACAACTAAAAAGTCTTAGGACCTCTC 

GCTGACTTAGGAGCCTGATCCCAACTTTGAGAATGACTCAGTGTGTTACC 

CTGTGGCTAGTGTAGACCAATGATCCTGTCTCAGAGTCACTAGCCAACAG 

CCCATATCAAGTAATTGAAACTTTGACTCAGAAAC CTCAGTGTCAGAACC 

TTTGACTTAGGAACCACCTGTAGTGGTTAACTGCAATTTGCACCCCTTAG 

TTCAGGGCTTTACAACACCGGGGGCGGGGAGGGGGAAGGCATAGAGCTGA 

TGACCTAAAGGAAACCCATTGCAGCAACGCTTTTGTGTTAAGTTTACAAA 

TAAG7GTTGTTTTAGAATCCTCCAGGTAATGCCTTTGTTATTTAATGTG7 

CTGAGACAATTCTGCACATTAAAGAATATAAAATATTACCTTGTAATTCC 

AATTTGAAATGTGTAATTGACATTAGACTTCTATTTTAATTTGAAATGTC 

TAAAACAATGTGGTTAAGTTTGTAAAAGGTGTGTGAATTTTGAGTCTGAT 

TTACTACATTTTTTTTTAATTTTCTTTTTTTTTGGAGTTTTAGGGATTGC 

TTAGATGGCTAGAAAGATCGCTAGGCACATGTCC 

>Contig45 

GATGTGTGTACGTGTGTGCAAATACCGTGCCTTTTTTGTTTTCTTTTGTT 
GAAACAGAGTCTCACTCTGTCGCCCAGGCTAGAATGTAGTGGCGTGATGT 
CAGCTCACTGCAACCTCCGCCTCCCAGGTTCCAGTGATTCTCCCGCCTCA 
GCCTCCCAAGTAACTGGGATTACAGGCGCCCACCAACACGCCCAGCTAAT 
TTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGTC 
TCTAACTCCTGACCTCGAGATCCACCCACCTCGACCTCCCAAAGTGCTGG 
GATTACAGGCATGAGCCACCATGCCTGGCCAATACTGTGCCATTTTATTA 
TCAGGGACTTGAGCATCCATGGATTTTGGCATCCATAGGGGTCCTGTAAC 
CAATACTGCACAAATACOVAGGGACAACTGTATTCTAAAAAGACCAAAAA 
TTAATAAGCAGGACGCTGAAGGTAATTGCCCCAATAAAGTCATGATCCCT 
TGCCCAGTGTCTGAACCTCAGCCAGTTTTCATACTCAGGACCTATTGGCT 
GCAGAGGTGGTAGGAACCATATGAGAATCCTGCAATATCATGGCAAGTAT 
GCACTTTAATGATATCTGCAGTCCTTCCCCAAAAGGACCTTACATTTACC 
ATACTGC7ATGTCCTGCGTGAGAGGGTAATACTCAGATTTTTTTTTTTTT 
TT7TTTTACACAACGTCTTACTGTGTTGCCCACACTGGAGTGCACTGGCT 
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CGATCTTAGCTCACTGC .CTTCTGTT1 . JTGGGCTCAAGTGATTO& 

GCCTCAGTTTCCTGAGTAGCTGGGATTACAGGCGCCCGCCACCATGCCTG 

GCTAATTTTTGTATTTTTAGTAGAGACGGAGTT7TGCCATGTTGGCCAGG 

CTGGTCTTGAACTCCTGACCTCATGTGATCCGC7GGCCTCCCAAAGTGCT 

GAGATTCCAGCGTGCGCGGCCATACCCGGCCGGGAATTCTTTATATATTC 

TGAAAACTAATCCTTTGTGAGACATAAGTGTTGTAAATATTGTATCCCAG 

TTTGTGGCATGTATTTTTAATTTTTAATGGTGTCTCTCAATGAAAAAAGC 

TTAACACTTAAATGAGGTCAAATTGATCACCTTTTTATTTATGGTTGAT T 

CCTTTGGTGTCATGTGTAAGGAATGTTGTTCCTTCCTGTCCCAAAGTTGC 

AAAGATTTCTTGTGTATTTTGTCCTAAAAGTTTTAAAGTTTTGCTTTTCC 

CATCTGTGCACATTTCACATTTGCTACATCTCACTGACTGCTTCCTCTGC 

TGCAGAGCAAGCTCCATGAGAGCAGGAGGCATGGGTCCTGCTTCTTGTTG 

GTCCCCAGAGCCCTATGTCATGACTAGGACCTGGCAGGGGACTAGTGAGT 

AGCTCCTGACTAACTGACTCAATGAATGAATGATTGGATGATTGAACAAA 

GTGGTATGGGAGTTCACAGCGAGTAAGAGATGCCTTAGAAGAGATGAAGA 

AGGAGATGGTATAGGGTAGTGGTTCTCAATTCTGGGTCCATGGTGGACTC 

ACCTGGGGACCCTTAAAATGTACCGTGGAGGATCCCAGCCCAAGAGATTC 

TGTATGACTGGTCTAAGATGTGGTCTGGGCACCAGGTGATCCCAGTGTGC 

AGCCAGGCCTGAGGCCACTGGATTTGGTGGTAAATGAGGTAACTATCAAG 

GGTACAGACGTTGGTTGCCAACAGGCTTGGGCTTGAATTTAAGCTTTGTC 

ACTGACTTGCTGTGTCCTCCTGCACTCGTTGAGCCTGTTTTCTCAGCTGA 

GAGATGGGTGTGATAACACCTACCTGCTGTAGTTGTTGTGAGAGTTAGAG 

GAGATAAGCATGTTCCTGGAATGAAGTGTGTTCTTAATCCATCATAGGTT 

TTTTGCTTGTTTGTTTGTTTGTTTGTTTGTTTTTTCCTTTTCAAGAATGA 

GGTTGAGCCAGACTTTGACAGCTGGGTGGGAAGTGAACATGTGGTGATTG 

GGAGAGAAGGGCAGTTTATGTGAAGGGAATGTAATAATTAGAGAGTGGGC 

GTGGGAAGACATGCTGGGGAGAGTGAGCAGGCCGGTTAGCCCTGGTAGAG 

GGTGCAAGAGAGCAGTGCGGAATCTGCCAGGGAGACAGGTGGGTGACCAG 

GGTGCCAAGGGTGTGGCTTTTCCCAGGTTCCCATGGACACAGCCATCCTC 

CCAGATGCCCAGCCTAGCTGTGAGTGAGCAAGAGTTCTGGATTGTCTCTC 

TCACTCTGTCTTTTTCTCTCATTCCAGAAAC^AAGGAGTGACTGGTAC 

AGGAGGAGAATCAGGTCAAGTTGGGAGAAACTTGCTTCTGCTCAGGGGAG 

CAGAAGCAAGAATGGAGGCCCCACCCATGCTGGAAGATGATGAGGGTTTT 

GGTTCAGGGAGGAGGAATATTGGGGATCTAAAGGGGCCTGGGAGTGGGGC 

AGGACCCTGCCTTAGGACAGGTAGAAACATTTTCTATAAAAAATGGGGTG 

GAGGTTGATGGTAGGACCAGGCATCTTTAGTTGGCTCCCTGGAGTGTCAA 

GCCCTTGAGATGGTCTTTAAAAGCCATGCAGTGGGGTTTGAATCTGGTGT 

TCAAGCTCATAGGTTATTAACATAATGACACTTGGAAACTATTTGGGAGA 

GCTCAAGTGAGTGGCCTGGAAGTTCTGTGTTGGTGCAGGAGGTGACTTAG 

GATGTGCTGCTCCAGACTCATATCTTTGACTGCACACCTGATGCTTCATC 

TGGCTATCCTGTAAGCACCTTCAACTTAACATGTCCTACACAGAACTCTT 

GATATTCCTGTTCCTCCCCCAGTTCCTCAGTTCTTACCAAATGTTCTTCC 

AGTTACCCAATTGCTCAAGTAAAAAATCTAAGTCCTTCTCTTGGATTTCT 

GCCTGTTCCCTCAACATCCCACCTATCCATGAGTGTTCTGTGGGCCCTGC 

CTCTGAAATAAATCCTGCCTTTGTCTCCCAGTTCACTCCAGCCACCCATC 

CTGGGGCTGCACCCTCCTCCTTCCZAAGCCCTCTCCCTTTCCTTCCTGGTG 

CTGCCTGTCATGTCAAGCATATGCATCAGTGCGACCAGGACATTTGAAAT 

GCAACCAGTACAATTGGGCGCGGTTATGCCTACCAGTTTTTCTTCCTTAA 

ACATTTTATATTTATGTTTGAAAGCATGCCACCTTTCTTCACTTGCCAAC 

TTGACAGATTTATTAGTTGACAACATCCGCTGATAGCATCAGTAATAAGT 

TAATTGTTTTTGCACATGTAGCTTTAATTATTCTCATTATCATTTATAGG 

AGTTATTCTTTGTAAAGGGTAACTGAGTTTTCCAAAACAAACAGAAATTT 

GGGGTGGGCCCATGGAGCGTGACTCATGAAATCAGATTCTTAGAAGGACC 

TCGGCAAGTCTCTGGGTTGCTGTTAATGAGCCTGGCTGGCTGCCAGGGGT 

GTGTCTGCCCTTTATGAGGCCACCACTGTTCAAATGCTTGCCTGCAGCAT 

TACTTGCCTAGGTAGTGCTTGTTTCTACTGAACTGTCAGGGATCCAATTC 

TTTGTGGTCTAAGTAACAATACTCAGATTCACAAGGAATTGATTAATAAG 

CCAGAATGCCAATGTATTACATTTTTGATGAAGACCATATTTACAGTGAT 

TGTATCTGCTCAAGCTCAAATTAGGATTAGAGTTC7GACAAATACATATG 

TGAGAAG7ATGAGGTTAAATACTTGAAATTTGGACTTTTCTAGAAAATCT 
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GAATGTGATTGCCATTCACATACCTTTCTGGGGATGATGATTCTTGTACT 
TTTATTTTAAAAGACATAGAAAACTAACTTAAGAATCAGATTGCTTGGCT 
GGGCACAGTGGCTCATGCCTGTAATGCCAGCACTTTGGGAGGCCAAGGTG 
AGTGGATTGCTTGAGCTCAGGAGTTTGAGATCAGCCTGGGCAACATGGTG 
AAATCCCATCTCTACCAAAAATACAAAAAAAAAAAAAAAAACAACCAAAA 
AGAATAAATTAGCTAGGTGTGATGGTGCGTGCTTGTAGTTCCAGCTACTT 
GGGAGGATGAGGTGGAAGAATTGCTTGAGCCCAGGAGGTGGAGGTTTCAG 
TGAGCTGGGGTTGCAACAGTGTACTCCAGCCTGGGCGATAGAGTGAGACT 
CCGTCTCAAAAAAAAAAAAATCAGATTGCTTTATTGCTGGTTTTCTTTCT 
AAAACTGAGATTGGGTCCCATCATCCCCTGGCCCCCATTGGTTAATGGTT 
CCTCCTTTGTCTATTGAATAAAATACAGATGTCTGCTTTTGGCAACATGG 
TTGAATGTAGACACTGCAGGGTCTTCCTGACTCAAAATGATTTAGGCTTA 
GATAAAACACATTTGGAAATGCATTTCTGGATTAACACCAAGGAAAGGAG 
ATCTCTTTAAATCCCCTTTCTGTTCCCCCCTCCCTACCCCCTCCAATTGG 
GCTTAAGTAAGAAGGGTGGTTACCCGCTAGTAAACCCCCTTCGAAGGGGG 
TCTTCTCCTCTAAGGGAAAACCCTTGTTTTGACATTTGCTTC3CATGGGCC 
CTTGTATTTTGTTCCTTGCTAAACGGGTGCTAAACCAGGGGCCTCCTCTT 

>Concig46 

AAGGCTTTTAGAATATTTGCACACTTTAGAAATGGAAATGTTTTTQGGGG 
GCGAGTTGTCTTAATATTTCATTTTTCTAGCTTGTGTGACATCCTTTTGA 
AAGCAGCAATTCTGGCCTTTGTGAGAGATGGTGAATGCCTGCAGGTGTGT 
GGACCAGTGCGTCCCTTCCTTCCTACATGCACGGCCCCCAGCTGGGCCCA 
GCAGAGTGCTGTTACAGAATAATTTCCAAGGGCTGTGTCTCTAACCTTTG 
GTCTTGTCCCCCATTGCTGTAGATTTGGCCAATTGACTTCATAAGTGCCT 
CTTATGAACATAGATGTTGGCAATGGAAGTTGAGGACCAGTCAGTGGTTG 
TTTTATTGAACACACAGCGTAAATCCCAACACAATGCTGACCTAAGAGAA 
TTCCAGCCACTCTGATTCTCAGTCTCTTTATATCTGAAAGGGTTCTGTTC 
CACTTTTTCCCAGATCAAAATGTCCCTGCAGCTACTCAGCAGAGCTGTCG 
CAACTTATACGTAGAAGAGGTAACAGTCCACAAACAGAAAGGCACAGGAC 
GAGAGTGGTCTGGGTGATGCTTCCTGTGGGGGAAAAGGTGATGAGGGTGC 
ATCTGCACACCTATGTTCATAGGTAAGTCTGGGAGGAGGTGACCTCCCCT 
TTGGTTGAGGTGCTGAGGCGTCTTGTTAGAATGGCACTATTCCATTTATC 
TGATGCAGTCTGTGGGAATTTTGTGGTATGGCCACCACAGGTACCATGCT 
GGGAACAATGCCAGATACTGCCTGCTAAGCCACAGCATGAGTCACATGAG 
CATTTGTGGGCTTTGGGAACTAAAGTTATTGAACGATAGTTATCTGAAAA 
GGAATTTAGGGAAAGGGGACTTTAGTCCAGCGAACAGTTTGCAAACCAGG 
GGGAAGGCAGCCTTCAGCGTAAAATGAAGACGTGTGTGCCCCAAATAACA 
AAGGGAGAGTTTGTCTTTTAGAGAGTAAATGTCCACGCAAGGTTCCACTT 
AGGCAAATGAAAGATGCAAACTTGCTTAGTTCTGATTTGTTTACATTTGC 
TGAATTCGGATTGGTCCGTGCZAGGCTTTTCTGGGAACTCCAAATACATGT 
ATGACCTCTAGTCATACATGGCAAATGGCCGCTTGGCTCTAATTTGAATT 
TAGGCCCAGTTAGTCACTCAGGATTAACCTTTTTCAGGGTTCACAGCTCT 
GAACAATGGACTTAGACCTGCAGGACATAATCTGTTCCTAACTCTGGGAC 
TACCTGTGCCTTTTGACTGTGCCCAGTGAGCAGCTGTGGCTCTGGGCCCA 
GACCCACAGGGCGATAAGGCACAGAGGTACGCATGGAGCAGGCTGTCCTT 
GCTGAGTGATCATGAAGATACACTTACATAGAGCAGCACTTTTCCTTCCA 
GTCTTTGTGATTTAACTCATTAGATCCTTATAACAAGAGTCAGTCCTCTA 
TTTAACCCATGAAGCACAGGTGGAGTCCAAGCTTAGTTTGTGAAGGATGA 
GCCAAAAGGATTCTTCTCTTGTAGACCTCAAGCTCAGCTCTCTCCATGGG 
CCCTGGAGTAGGTGAGAAGGCCTCTGTCTTCCAGAGCCCACTGCCAATCA 
TCTACATTTTCTGTTAGCCCAATTCTAGGACATTGCTTTACCAACTGAAG 
GGTGAGAACTATCATAAGTTATAAAAATCAATTGAAAAACAAAAAGGTAC 
AGAACAGAAAATT^AAAGATGAGAATCTATTAAACATAGTGATGTTACTGG 
AAAAGGGGGTCTCAAACCAGACCCCAAGAGAGAGTCCTTGGATTTCACAC 
AGGAAAGAACTCAAGGTGAGTTGCAGGGTGCGGTGAATTGAGAGAGTTTA 
TTGAAAGCTATTCCATTACAAAGTAGAGCATCCTCAGACAGCAAGTGGAG 
GAACATGCCATCATTAAATTTTTCTTATATAGGAATCTTGTCTATATAAA 
GACTAAACTAAGCTGTGGCTATGTGTGGGTGGGCCGACAGCATGAAAACA 
TTTATTCTCCTATTGATTTAAAGAGAACTATCCTTGACATTTTAGTGTGT 
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TAGGGATTGGGACATCTGGGCTTTCTGTTGTTGTAGAAGTTTGTCCTTGC 
AGGGATTACCAAGCCACTTCCTTAGCTGTAAACATCTTAGGGCCATGGGT 
CCTGACTGGCAAGGAATGTGTCTTGCTAGTTTTAAGATGGGC7TGATTTG 
AAAATGGTGTCCATCTGGCTCTCCTAGGCTCCTGCTTTCCTAACAGTAAG 
GGTAAATGCTATGTTATGAAATGTCATTTCTGCCTTTAGCTTGCAAACTC 
TTGATGGTGAAATTCTCCTGTCCGTTTTCAGTGGGGTATTTATTCTGCAT 
CCACGTCTTCACAAGGAGCTGAAAACAAATTGGATGGAAGCAACTGGGTT 
TTATGGGACACGTTAATGTTTTAATGTCATTTGGTGTGGAATTCAGATGT 
CCAAG CAACATTTTACACTACAAATCTGCAACTTTAATAAT CACT CAAAG 
TACCTGAACCTCAATGCTTTCAGACAGACTTGGTATAAAGCCACCACCTC 
TTTCTATTATGGCAGCCCTATCCTGAGGACACAAATTTCTGCAGGGCTTC 
TGGCATATCTCTGATTAAACAAATGTCAACAAGGTTAAAACAAATGTCAT 
CTCTGATTTGTTTGTTTTAAAGCCTGGATTTACTCATTGAATATTTCACT 
CCTACTAGCATGTCTTGTAGTAGTTTTCTTCAGGGACCCTAATTATTGCT 
ATTAAAAATATGTGTGCAGCTACATGTTTTTTTTTTATCAATTTGCAATG 
AAAACTTTAATTGAATAATCTATTAGTGTTATTATTTGAAAGTGAAATCT 
TTTCCTTTTGCTTTCTTGTTCTCACACATAGTGCAGACAGTTTCCACACG 
GGCTCATAAAAGGAATGATTCTGCCTTGTGTGAACTTTTTGCCTTTATTG 
TTAATTGCACCATTTTGTGACTGGCTTCTTGACCCTGTTGTAACCAAGCT 
CATAATGTACATTATTTCTTATTTTGCAGTTGTAGACACTTGAGGAAGTT 
CCCATTCTTTGTTTCTTCTTGCTTTTGTTCCCTGTGATAACTTTTTCATG 
CAGACATTTTTTTTTTTTTTTTTTTTGAGACCGAGTCTTGCTCTGTCATC 
CAGGCTGGAGTGCAGTGGCATGATCTTGGCTCACTGCAACCTCTGCCTCC 
CAGGTTCAAGAGATTCTCCTGCTTCAGCCTTTCTAGTAGCTAGGATTGCA 
GGCGTGCACTACCACACCCAGCTAAATTTTTCAAATTAGCCACCCCACCT 
GGCTAATTTTTGTATTTTTAGTAGAGACAGGGTTTCAACCATGTTGGCCA 
GGCTGGTCTCGACCAGGTGATCCACCCGCCTTAGCCTCGCATAGTTGCAG 
GTGCTATTCTGAGCTCAGGGCTCTGGCAGCTACAAGCCCAAGATGCGGTC 
TCCAACATGTGGCCATTCAATGTCATGGCGCCCTCTACTGGTCCTGGGAA 
GCGCAGCTCTGCCAGTAGCTCCAGCAGGGCACAGCTGTTAAGTCGTGATG 
TTCTACAGGTGACCAAAGGGCAATCTCTGGACTCCTTAGCCGCTAGGTCC 
TCTCTGTAGCAGGACCCAGGAGAAGGCAGGGGCTGAGGATGGCTCTCTTA 
GACATTTGTGATGAACCAAACGTGTGCATTCATGAAACTTCTGTGAGCAA 
GCAGGTGAGTAGAGTTGGGTTATAAAAAGTCTTAGGGTCTCACTACAGAG 
ATGGACTTGCTGTGTAGATGGTGCAGAGCCGCTGAAGAGTTCTACTTGGG 
GTAATGGTGTGATTGGGTTTGCGTTTTAGGAAGATTTCTTGGCCAGAATG 
AGGCGGGCAACCCAGAGCAGGGAGTGGCCACATGTGGGTGTGCAGTTATG 
GGCCACTAATCCAGGTGATAAATGGTGTCTCTGAACTTCAGGTGGGGGTG 
CCACATGTCTCCATCTGCTCTGTACCCTTGAGACTGGCCTTATGGGCTGC 
CTTAGTGGTCTGTTGTCCTCTATCTCCTGGTTGGGCTCAGGCAATGGGAG 
ATCAGAGGGAGGAAAGAGAGCTTGGTTAGAGTGCACCCGCGCCCCTTCAG 
GTTGGCAGTGGCCACATTCCCCTATACAGAAGGCCACAGTTTCTGTCAGT 
GGCCCTCCCACAGCCCCAGCTTTCTCAGTGGGCCAGCCACCTCCCCATCC 
CTTGCTCCTCCTCCTCCAGAGAGGGTTGTGGATTTCCACTGTCAGCAGTG 
CCTGGAGCTCCACCATCTCCTGCTGCTTCCTCTGGACCTGCCTGCAGTTT 
TATAAATAACCTTTCCTTACATTACCTCTAGCATGCACCTTTTGTGTGTA 
TACTCTGCCCCCTGTCAGCACATGACTCATGCCAAAGAGTTTGAATTTTT 
TTCTCCAGGCAACGGGAGGTCATTGGAGGATTTTAGACATTGAGAACAGA 
TGTGTATTGTGGAAATATCTGTCTGACTGAAGTGACCAGGATGGTCCAAA 
AGAGCGAGAATTTGAGGCAAGCAAACCATCAGCAGGCCAGCAGCAGAAAT 
CCAGGTCATAAACAGGGAAGCTGAGGCTCACAGGGTTGGATCAGGGAATG 
GGAGAGGGAAG CCAA ACAATTCCATGAGCATGTCAGTTGCACATATGACT 
TGGTAACTATTTTTATTTTTATTTTTATGTTTTGAGACAGAGTCTCGCTC 
TGTCACACAGGCCAGAGTGTAGTGGCATGATCACAGCTCTCTGCAACCTC 
7GCCTCCTAGGTTCAAACAATTCTCCTGCCTCAACCTTCCAGGTAGCTGG 
GACTACAGGTGCG CAC CACT ACAC C CAACTAAGTTGTGTATTTTTAGT AG 

AGATGAGCATTCACGCTGTTGCCTTAGACACGG 
>Contig47 

AATATTGATTATTTGACCAGAAATTCATGCAGCTAACCGTGACCCCTGGC 



FIG. 4 (27 of 61) 

filar' 



WO 99/06426 



PCT/US98/16102 



XAAATAAAATAGTGTAT^^GTACGTGCAlATAC^TGCAAAGAAATbAGV - 

GAAACTAGAAGGATGTCAATCAAATGATAACATGGTCATCTTGGGGTCGG 

AGTACATTTGGGGATGAGGGGAGCTGTAAAAGCAGACTTGGACCTTTTCT 

TCTACCAGTACCGTGTCATTTGAATTTTGGAAAGAAAAAAAAAAACTCAG 

AAGGAGGAGAAGGAGCAGGAGGAGAAGAAGATGGATCTTAAGTGATTTGC 

CCGGGAGCACCTTGAGAAGGTGAGATTCAAGTCTAGGTCTAAGCTTTCTA 

ATTCCATGAGTGGGAGTGACCCACGTCCAAGAGGAAGCTCAAAAGGAAGA 

TGTTCTCCATCATCTCTTGCTCATCCTAACAGCATGCAAAACCACATCCA 

ATGCAGCTCAGAAAACTCCCAAATTGCCAAATTTCATTGGAAACACTTAA 

TGCTGTGGTTTCCAATTTCAACTGTAAAGTAGGTATGTATGCCATTGTTA 

CCATTAACTTCTCAGAAATGGAGAGAGCTCTCTTTCCGCCTCCTCCCCCT 

GTGCTGTGGCTTTGGTGAGACGTGCACTCAGGCTCACCTGTCTCCATGAT 

CTCCAGTAAGTACACATGAGCAGAGAGGCCTCAGCTCAGCTCTTCCTGGT 

CCCACCAGGGTTGATTCTTTGAGAATTCTAGAATGCCACATCCTAGGCCC 

CCCAAAGAAATCCTGCATCTTACCCCCAGAAATATGAATCATAGCAAATT 

TCAAATCAACCATCGTTTAATACTCACAGACTGGGCACATCC^AAAACAT 

ATTTTCAGTTTTACAACAGTGCCTGGTGCATATCGGCACTATTTGTGGAA 

GCAATAAATCGACACGGAGCTGAAACACAAACAAATGCCAAATTGTTTTT 

ATAACACCTGATTTTCITTCTGTTTCTTTATGCAGTTTAGTTTTGTTTTG 

CTTAACTCTACCTCAGACCATAGTCTGGTAAACTCACCACCCAGAAGCTC 

CCTTGAAATGTGGGTATGCAGCCACTAGGTGGCAGGAGAGAGTTTCCTGC 

CTGGAGGGAGGACAGCCACTCTGTCCCCGGGTCAGGCCAGGGCCACCCTG 

CTACCTGCAAAATTAGCATGGGGCTTTATGAACCACAGCTTCCTAATAAA 

CACAGGATCTGTTTGATAGAGACTCCAAAACACGCCTACCTAGTGATGAA 

AGACTCAACTTCAGAAGAAAACCTTCATGGCAAACATCTTCAGAGATGTT 

TCCAACTTAAGGTTCTGAACACAGACGCTTCCCCAGAAAGCCATTGTTTC 

TCAGCACCTGGGAGCCTTGCTTTGCTTTGCTTACAGACTCGCTGTTCTTA 

AATCACTGCCAAGATAACATCTGTCTCTTCTCTTACCCTCTATTTCGATA 

TAAGGACTCCTCZACTCTTGTTGCTTCCTATTGGCTACCTCTCCACAGGGA 

GAAATCGCTGATTTAACAGCAGTCAATATCCCAAATCTGGAACAGGGAAC 

AGGGAAGCATTTAAAAATTGGAGAATTTAGGCCGGGCACAGTGGCTCATG 

CCTGTAATCTCAGCACTTTGGGAGGTCGACGTGGATGGATCACTTAGGAG 

TTCG AG AC C AAG C CTGGGCAACATGGCGAAAC CTCAT CTCTACAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAACCCAAAAATTAGCCGGGCATGGTA 

GTGCACACCTGTGAGCCCCAGCTACTCAGGAGGCTGAGGTGGCAAGACTG 

CTTGAGCCCTGAGGTCGAGGCTGCAGTGAGCCGAGATCACACCACTGCAC 

TTCAGCCTGGGCAACAGAGTGAGACCTTGTCCCAGATAAATAAATTAAAT 

TAATTTAATTAGAGGATTTAAGGATTTTCCCTACAGACACCTCCTTATTT 

TCTCTGGCCTTTTCTGACTACTCTCCCTAACTCCCTGCTCCTCTGGTCTC 

CCAAAACTACTCCAGAAAAAAAAAAGGGGGGGAGGGACTAAAGGAAAGCC 

AGGTGACAGTGCCAGTGTGACAGATGACAAAGCATCTGCCCGAACAAACC 

GTAGGTCCCTGAACTTTCTCCAAGACCTGTCTGTGGACTTACCTATGAAA 

ACCAGTTTTAGCAAAAACCCTCCTAAGCCAGTTTATCAAGATCCCCTTAT 

CCTCAATATCCATCTGATTGGATTCTTCATCCCCCACCATTCCCCAGTGA 

TGTCACCAGGCCTTTCTTCAGCAACAGTAGTTAGTGGGTGTAGCCAGGAC 

GCCCCCTCACCCCTGATATGCCCTTTTAGTAATTCTTCATCCACAGGTTC 

CCACCCTGCTCCTAGGCTATACATTCCCATTTGCCCATGCTGCATTCGGA 

ATTGAGCCCAGTTCTATACTGAGGTCTTACTTCACCTCTCGCCATAGTCC 

TGAATAAAATTGGTTTTCACATTTAAAAACTGTCCAGCTCTGGTTGTTCC 

TTGACACAGGGTAATTTTTATTCCATGTGATAGTTTGCCTTACCTCAGCC 

TACACCCCTCAAACCTGCAACTCTATATTCAAGAACCAGACAGCCCTTTC 

CAACAGATAGGAAGAGGCTGCCCTGGTGCAAAGGAAGAGGCTCTGGGAGG 

AAGG AGAGAAC C CGAAGGCTGC C CC CTCCT CTAGACTGAGCT CTGGGATG 

GGTGGACGATAAAACCCAGATACGTTTAGACATCTGAGCGTGGAGAGGAC 

TTTGCTTTGCTTCCACAGGGACCCCAAGGAAACTGCAAGCCCTCCAGAGA 

CTAAAAACAGCAGAACAGCAAGAAATGGCAGCAAAGGTCTGGGCAGAATC 

ATCCTATGTGGGCACAGACACAAACAGAGTCCCCTGTGGCCCCAGGAGAG 

TTTAAAGAAGATCCAGAGGCTGTCCTATTCCATATCTCAGCAGAGACAGG 

CCCG7GAGCCTAAAAGCTGATCATTAGGACAAGAAGGACACGAACTGTCC 

TGCAGCGTGAACCGCGTGGAACAAGGCCAATCACCAGACACCAGACCAGC 
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CAGACACAGCCCCGCAGTTCCCCAAGACCACCACGGACCCATCGCCCCTC 

ACCAATAGCTCCAGGCTACATAGACCCCCTCCACTTCATGGATGTCCTCA 

GAGCAGAAAGGGGAGGCAGGAGTGGAACCCTGACTTGGTTCAGTTGAAAC 

ATAAAATGACTGTACTATTATTGAATTGCTGAAGTTTACGTGAAAGAAAT 

GAGATTTAGTTTTTGGCCZACAGTGCAAAATAAGAAACGAGGCTTCAACTG 

AGATTAAGGTGAGTTATAGGAAAATGTACTCCCTTGAAGGACCTGTGAAG 

TGTCTTCGCTATGAGAAAATGACCAGAATCCACGTTCTTAGCTGCGGGAC 

TCAGGCTGACTCCTGTTTCTGGAGCTTGCACAAAGGGCAGGGAAATCCCT 

GTTTCAGGCACAGTGATTTCAATGTTTAAAAGAAAACAGGTGGGCCCTGG 

CAATCATGATAACATGTCATAAGTTTACATCTCTGTGAGGCAGGTAGTGT 

AATCCCCATTTTGCAAAGGAGfGAAACCGAGGCTGAAAGCAGCTACATGGT 

CTCTTCAATGTGGCCCAAATGTTGGAGAACAGAGCTTAACTGAATCAGCA 

ATTCTATACTTAGAACTGACTCTCTCTTTATTATATCTCACTACTACCTT 

GATATTTGAAATATTCAACTTTTTTCAATCAAAAAATAACAATAATTTAG 

GCATAATGACTACTATGTCATTTAATTTCTTGCTGATATTTCAATATCCC 

ATGCCAGGAATATTGAAAGCTCAGCTCCTTAAGAGCTGACTATGGCATCA 

ACTCCCAACAACCATCCTTCCAGAAATATTTTCCCCTTTCTTTTGTTATA 

GAGTGGCACTGCCCTATATGGTGACCACTTGCCACATGTGGCTGTTGAAC 

ACTTGAAATTGGCTTGTCAGAATTGCAGTGTAAAGTGTAAAACACATACC 

AAATTTCAAAGACATGGCACATAATAAAAAATGTAAAATATCTCATTAAC 

AATTTTTATATTGACTGTGTAAGTAACATTTTGAATATATTGGATTXAAT 

ACATGGATGATGCCCCAACACCCACAGTCCCTTATCAAGTCTCTACTTCA 

CATTTTTGTACTTCTGACTTAGAAATAGCACTGGCGTCTAAGAGCCTATT 

AATGTCGTCAATAGGTTCTTGGGAACCACAATTTTAAACAAAATGACATA 

TAAGAAAACGAATAACATTGAACAAAATGACATTATTCGAGGACCTGCTG 

CATGTTGTTTCACTTAAAGTCAGTGTCCAAGAACCTATCAGTGACATTTA 

GTGAGGACTTGCTGTCCTTCCTGTTTACAGGAACCTGGGCAAGTTACTTA 

ATTCCTCTAAGCCTGGTTTATATCCCTGCAAAGAGAGAAGGATAATAATC 

ACCAGTACTTAGTGATGTCGTAAGGAGAAAATAAAATAATAAATATGAAA 

TGGCTGACAGTGTCCTTGTCACACAGAAGATGTGTGATCCACAGTAGCTG 

CTATTGTCTGCCTCACTTCACTAGTAATGGTCCAGGGAGGCCTTTAATGT 

GCATGGTGCAGTACATTCACATGTTGGACATGGGTGAAGGGAAAGACCAG 

GCTCATCTAAACACAATAGGATGCTTGTGGTGTTTTGAGGAGGAATCAAG 

GACTAGTTATCCACAGCTGTAACATGCATGGATCAAAAGAGATAAGGCAC 

ACAAAAGACTTTGTCAGTAGCAAAGCATTACAAAATGCAGAGACCAGCTG 

TGGGTGGTGGTGAGTCAGACCCAGCTTCCCTCTGTGCCTGGCTGAGTGGT 

TCTGGGCAAGTCACGCCATCTGTCTTGATGCCCTTCCCCATCTATAGAGA 

GGGAGCAACTGAGGCCCCTTCCAATACTGAAGTCCTTTATTTCTGCTACT 

TTAGAAATATCCACATTTTTGGTAAATTCAAATGATCCAATGATTCCATT 

TCCTAATGTTCAAAACTAGCCCCAGAAACATCTAAATGAATCAAACAAAT 

AAAATATTTATTGTGTATGTTTTGATTGCTGAAACTTCTATTTTAGCAAC 

ACACACACACACACACAGAAC C CATAAGC CTTCAT CTTTC CTTGG ATAAA 

CGAGCCTTCCTGTCTGGCCATTTAAGTCACGATTAAGTAAATGATTTCCA 

ACTCGCCTTTTGCAGCAGTTCAGATGGGTCTTTCCTGCGTGGCAGTGGCC 

CTCCTGACTTATGATTTCCTGTGTGTCGGCeTGTTACCACTGCAGCTTAA 

CTGAGGAAACAAGAACAAAACAGCTTCTGACCCCAAGAGACTGTTGGAGG 

CAAAGGCTTCAGTCCCAAGAACCTCACACGTGGGGAGCCCGAGAGCCCAG 

CCCTGACCTTTTCTCCAGTAATAACATAAGAAACAACAGGCACTGGCCTT 

ATTTTGGATACAAAGAGTGGTGCTTTTCCTTAAATCTTCCTTTAGTCAGG 

GCTACCCCTTCATGGACGCCCCAACATCCATGGTTCCTGCTTGAGTCCCT 

GCTTCCATATTCCTGCACTTCTCACTTGAAATATCCCTGGAGTACGTTAA 

GCAGCCAGGTTTGGAAGTTCTTGCTGTGCAGGCGGGTGTGTGCATGTCCT 

CTCTCTCAACAGGACACAAGCTCCCCAAATCAGACGGTATGCCTCCACGC 

CCCTTCCCAAGCCTCCCCAGCAGCACCGAGCATGTGAGGGGAGCTGGGGC 

CCAGGCCATGATGGGAAGCACTCTCTGCCTAAAGACTAGGGTGATGCGCC 

CTCAACTGTGGGAATGAGCCCCAGCTCTGGTGTCTGCCTCGGTTTTTCCT 

CCTGGACAATCAACATGAACTCCTCACCCCTCTTATCCACTTTGCATAAA 

CTGAAAATAACAAACCCAGGGCTCTTTCTGTCACAGGAAAGGGTTTTTTT 

TTATAAAATTAAACAGAGATGATTCAACACACCCAGGATATAACACATGG 

GCCATGAATCAAGGGCAGCATTGCTCTGGTCAGCCTGTTGTTTGGGCCCC 
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CTTGGCAGGGCTCTCCCCrGAATCTTCCCCTCTTGACTCCC^TC^C^'CA 
GCACTCCANCTTTGTGTTACAGGCGATAAATGGGAAAGGGGTAAAT 
>Concig4 8 

CATTCTTAATTAGAGAAACGCTCATTAAACTAGACACCCAAATTCTCTGG 

GGGGGGATCATTCTTACAAGCATGCCCTTCTCTCTTAAAGAGAGAGCACT 

TTTTTCGCAAATAATGCTGCCATGAACATACGGGGTGCATGTATCTTCGT 

AA7AGAATGATTTCTATTTTGGGGGGTATGTACCCAGCAATAGGATTGCT 

GGGTCAAATGGTATTTCTGGTTCTAGATCTTCGAGATCTTCCACACCGTC 

TTCCACAATGGTTGAACTAATTCACATTCCTACCAACAGTGTGAAAGCAT 

TCCTATTTCTCTGCAACCTCGCC^GCACCTGTTATTTCTTGACTTTTTAA 

TAATCGTCATTCTGACTAGCATGAGAGACAGTATCTCGTTGAGGATTTGA 

TGTGCATTTTGCTAATGATCAGTGATGTTGAGCTTTTTTTCATATGTTTT 

TTGGCTGCAAGAATGTCTTCTTTTGAGAAGTGTCTGTTCATGTCCTTTGC 

CCACTTTTTAATGGGGGTTTGTTTTTTCTTGTAAATTTGTTTAAGCTCCT 

TATAGACTCACAATAACAAAGACATGGGATCAACCTAAATGTCCATCAAT 

GATATAACGGATAAAGAAAATGTGGTACATATATACCATGGAATAGTATG 

CAGCCATAAAAAAGAATGGGATCATATCCTTTGAAAGGACATGGATGAGC 

TGGAAACCATGATCCTCAGCAAACTATGCAAGAACAGAAAACAATTGTTG 

CATGCTCTCACTTATAAGTGGGAGCTGAACACTGAGAACACAGGGACACA 

GAGAGGGGAACAACACACATTTGGGGCCTGTCAGGGGTGAGGTGGGGGAG 

GGAGAGCATTAGGAAAAATAGCTAATGCATGCTGGGCTTAATACCTAGGT 

GATGGGTTGACAGGTGCAGCAAATCACTGTGGCACACATTTACCTATGTA 

ACAAACCTGCACATCCTGCACACGTACCCCAGGACTTCAAAATAAAGAGA 

GACAATAC7TCTCCCTTAAGTGTCTACTGTTGCTTTGCAATAAAAAGTTC 

CTGCC7TTCACTTCACTCTGACTTGTCCCTGAATTCTTTCTCGTGATGGT 

GTCAAGAACGTGGACACTGGCTGGGGCTGGAGACTCACCAGCATCCGGAG 

ACCCTCCTGAGCCCTCCAGCAATACAACTTTGACACAAACTATGAAATCA 

CAGATCCAAGAAGCTCAAAGAACCCAAGCACAGGAAACATGATGAAACTA 

CATGAAGGAACATCAGAATTGAATTGTTCAAAATCAGTGATAAAGAGTAA 

ATCTTAAAAGCAACCAGAACAAAATATCCATCATATACGCAGAAATAAAG 

ATAAGTATGACAGCAGATTTACAAATAGAAAAAAAAACAAGTGCAGCAAC 

AGAAAC^AACTATCAATCCATAATTCTATACCTAGTGAAAATTTCTTTCA 

AAACAAAGGTGAAATAAAAAAATTATTTTCAGGAATACAAAAGCGAAAAA 

ATTAATCACTAGCATTCZATCACTGCAAGAAATGTTAAAGGAAGTCCTTTA 

GGCAGAAAGAAAATGATACAAGGTGAATATTTGGATCCCTGCAAGGAACT 

AAAAAGATCCAGAACTGATAACTTAATGGGTAAACATGTAATTTTCATCA 

ACAAGTGAATGAATAAACAAATCATGATATATCCATATGATAGACTACTA 

CTTAGAATACAAAAGAAGAACTACTTATGCATGTGATAACATGAATGATA 

TTCAAAATTATTATTGAGTGAAAGACACCAGATCAAAACAAAGTACATAC 

TGTATGATTCTGTTTATATAAAACTCTATAAATTGCATGCTCTTCTATAG 

TGACAGAAAGAAGATCAGTGGCTGCCTGCAGACAGGAAGAGATTACAAAC 

GGAAATGAGAATTCCTTAAGAGATGATGGACATGCTCATTACCCATCATA 

TGTATACAGCCATAATGGTTTTACAGATACATATATATGTACACGCCAAC 

ATAAATATAAGTTATCAAATTACAGTAAGTTCTGACTTAATGTCACTAGG 

TTCCTGGAAACTTTGACTTTAAGCAAAATGAXGTACAGTGAAACCAATTT 

TACCATAGGCTAATTGATATAAAGATGAGTTAGGTTTTTGGTTTTTTTTT 

TTTTGACATGAAGTCTCGCTCTATCGCCCAGGCAGGAGAAGAAGAGTTAG 

GTTTTACAGCATGTTTCTGGTCACAAGAACATCATCAAACTTGTAAATAA 

AGGCACAAAACACTTCTAATATTAAATATCAAAATAAATATGAGTTATAC 

AGAATTTAAGAAAGATTAATAAAAACAAGTAAAATCATTATTTATGGGAT 

TTTTGGTAATCAGTGAGTTATGTGGTCATAGTGGAAGTGGGTTAAGTCAA 

GAAATAAATGTTTGCAAAACAAAAATTTTAAAGATCCTCTCCTACCACCA 

CACAAAAAACAAGAAAACACGGTGGGCTCGCTAAGCACTTTTGTACCACT 

CGTATCTTATGCGTTTGTATGATTATTGTAAATGCTTTATGATAATTTTT 

AGAGACAGGGTCTCACTCTGTGTCTCAGGCTGGAGTGAAGTGGTGCAATC 

ATAGCTCACTGCAGTCTCAACCTCCCGGATTCAAGAGATCCTCCCACCTC 

AGCCTCCAGTGTAGCTAGGACTACAGTTGTGTGCCACCATGCCCATCTAT 

CTTCTTTTTTATTTTTTGTAGAGACAGGGGTTGTGCTTTGTTGCCCAGGC 

TAG7CTTCAACTCCTGGGCTCAAGCAATCCTCCTGCCTCAGCCTCCCAAA 

ATGCTGGGATTTCGGACATGAGCCAGCAGCACCTTGCCCAGCATTTTATT 
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TCATAATAATTATAAGTCATTCCTTCATTCATCTTACAACCCACTTGTTC 
CAGTTCAGGATCTCGGGTQACCAGAACCTATTAACGTTCACGCACAAGTC 
AGAAACCAGCCCTGGACAGGACACCATCCTACCGCAGGGAGAACTTACAC 
ACCCACACTCACTCAGACTGGGACCATGCAAAGAACCTAACGTGCACTTT 
GGAATGTGTGTTCCATACCCACTAGAACAGCTAAAATTTAAAAGACTGAC 
CATACTTGAGTGTTGAACAGGATGTGAdACAACTAAATCTTTTAAGCGCT 
TCTGCGTAAATGGCACZAGCCGCTTTGGAAAACAGTTGGCAGTTTTTCAAG 
TTAAATATACCCAAACTCTATGATCCACTTCTCAACAATCAAACAAGAGA 
AATAAAAGCAATGTCTACACAAAGATGTATACACAAATGTTCATTGCAGC 
CTTAATTATACTAGCCCCAAGTTGAAACAAGCCAAATGTCCATTACCAGA 
TGACTGGAACATACAAATTGTGGTATATTGATACAATGAAATACTACTTA 
GTAATAAAAAAGAAAGAGCTATTAACATAAGCAACAACATGGATGAATCT 
GAAAACAATTATGCTAAGTGAAAACAGCCACACAAAAGTTACATACTGTA 
TGATCACATCTACATAAAATTACAGAAAAGGCAAACTAATCTATAGACAG 
AAAAGCAGATGAGTGGTTACCTAGGGATGGGGCAGAAGGGACGAAAGGAT 
GGATTGCAAAATAGCACAAAAATATTGGAGGGATGACAAATATATTCATT 
ATCTTGATTGTGGGGATAGTTTAATGGGTATATATAGAGATCAAAGCTCA 
TCTAATTATACACTTTAAATATATGTATTTCATTGTGCATCAGTTATTCA 
TCAACAAGACTATAAAATAATATATGCCTACATACATTTTTAAATATTCA 
AAATCTCACAGTTATATACATAAATGCAACTGAATATGTATTCAGATGTT 
TTAACAAGCAGAAAGGACTGATTAAACTCATGACAGCGGCTGTTTCTGGG 
AAGGGTGTAGGAGACAAGAGATGGAAAAGAGGATGAGAGCCAGAAGAGAC 
CCTTGTAATGTTTCCTTTCTTTTAGTAAAAATATATTGACAGTTAAAGCT 
GAGAGGTGAGAATAATAGTCTCATGGCTTTTGTGTCCTTAAAATTTCACA 
AACTAAGTGAAATGGGAGAAAGCAAAAAAATAAACTTAAATAAATGTTAT 
ATTGCCCAAAAAGAGATTTAAAATGGAGGTTAGACACATGAGACTTACGT 
TCTCAAAAAAGTAGAATCTGCAGGGAAGTTTAACAACTATAAAGAATTAA 
AATCTAGCTTCTACCAGCCCAAAGCCTAAAATGTTCTGCTTTATTCTTCC 
TTATTATAATTCATAGGTAATATATTTTATGTTTGCAAATGAATGCAGTG 
ATATTAGATCTCTAAGAGGTGCTAAAAATGAAAAGTAGATATTCCAATTT 
TTCCCAATTTTCCTTCTCTTTCCATGAATGAAAAATATACATATTTGATG 
ATTTCCAAGTTTATACAACCGATCTTTCTCTTAGTTTTCTCTTACCAAAT 
TCCCTCCCTCACTCAGCCACCAGCCAGTCCAACTGTGCTACCTGCACAGC 
AGCCCTCATACCATCCACACTCTCATCAGGATCCTGCCTGACCTGCGAGG 
AGCAGCAGCAAGAAGGAGACAGAACCTCCACGCTGAGCATCTCAGGGCTT 
TCTCAGAGACTCCAGAGGACCCTGATAGGGACAGAGCCTGGCCAGCAATC 
CATGCTGCCAGCTGTATGATTGTGGGCATGTAAATTCTCAACTGAAAATG 
GGTGTAATAATAACATGTTCTTCCCAGAATGAGCTTTATGAAGATCATAT 
AGCTGTTTGGAACTCAGACAAGCACTGGTAGGAATACAAACAGGGGAGCC 
AACAGCCTATAAATAATACTTTAAGAAAGGGCATGAATGTAATTACTTAG 
GAACAAAAGGCAAAGTGGAGAGATGCCTAGGACTGAGCTGGACAAGCTGC 
ACCCTTTAGTGGCTCAGCCCATGGGCrGACAAGGAAAATGGAGGAGCTAC 
CAAAGAAGGTGGAAGGATTCTGGGAGAGTGGCCCTCACCCTGCCCAGGGC 
AGGGCTCAGTGGGAGAGAGGGAGATCTGTTATAAATGCTGCCAGGAGGTC 
GAGTCATGTGAGAATGTCCATGTGAAAACATCCACTGTGTGTATCTAAAG 
AGAGTGGCTGTAAAACAGGTCAGGGTCAAAGGTCTTATTGTCTCAGATGT 
TATCTGCATGCATTGTCTCACGACCAAGAAAACTAAGGAGCATGGACACA 
AAGGGTTAGGTTGAAGCAAAAATTTAATAAGTGAAAGAAGAAGGCTCTCT 
GCAGTGGAGAGGGGAGTCTGAGTGGGTTGCCACTTTGACAGCTGAATCCA 
AAAGCTTTTATAAGAAACTCTTCTCATATCTGCAGCTGTTTGAGTAACTT 
CTCTTACCTATAAAACTGTCTGTATAACTCTCCCTTATCTATGCAGCTGT 
GGGATGTCT CCAG GTAAGCATAAAGTGTAGCTTCTCTTGTTTGTATAACT 
GTGGGTTTGTTTTAGGCAAGCCCCCATCCCCTCCCTGTGTAAGCTCCCAT 
GGAGCCCACCATGTGCATATCTGAGAAGTGGAGGAAGCTTTCTCTGGGAG 
CTCACTGATCGTACAAAGAACAAGAGGCTTCTGTGCCGCTTATCTATTCA 
GGTGCAGCCTGAGTTTTCCCCAGGCTGCTCTATTTTTGCCTGTAGCTATG 
ATTTTTCAGGCAGGCTGCTTCTCTGAAGACTAGCCTTAACTGTCTACCTA 
TCAGATTTTTCCTTTTCTTCTCCCTCAGCTGGTTCCCCTCACCAAGGCTG 
AGCAAGTGAAAAGGAGGGCACAGGGCAGGCCAGTAGTGAGCAGCAACAAG 
GAACTAAGACA5CAGAAACCACTCTTCACACCTGGGTTGAAAGGGGTGGG 
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GAGCCAGGACTACAGCluAGGTAAGAACATAGGTAAAGAGATACTGTTGT 

TGTGTTGTTTTTAACTATGAGAAGCATTGAGCTTTAAATTTCTACAGGAA 

GGATCCAGTTCAGACAGGAGCACCCAATATTCAGAAGAGAAGAACATGGT 

GTAAAGGTCCTGGGAAGGCTGAGAGGATTGGGACTCAGAATCCAGAGCAG 

AAGCCGTCTGTGAACAGAAGAAGGACCTCCCCCAGTGTAGCAAGAGGGAG 

GGAGGAGGGACAGATGCCAAGATGGTTCAGGAAGAAGGTTTGGTGGTAAA 

TGJGAGGCTGTGCTCACCTGCTGGCTTCAATTTTCTCTTTAAAATGTCAG 

ATGGAATCATTTGATGAAGGCCATGCCATGCAATGAAATGGCAGTCTGAG 

GCATGGAGCAGCTCCAGCTTAGCCCGTGTTTAGGGTAATTATGGCTCCAA 

CCCAGGAGATGAATATGACTAGGGAAAGTGAAGTCCAAAAACAAATGGTC 

TCAAGTTGACTGTGAGTCTTCTGGGAGGCTGAGACGACAGGTGGGGTTGA 

CAAGGGAAGGGGAACCCACCTGCTGAAAAACATCAGGCTGTTGGCTGGGG 

GAGGGGTGAGGCCTGTGTTGTAGAGATGGATGGATGCCTAAAGTTGGGTA 

AAGGTTTCAACTCTACCCTCTGCTGGGTGTGGAAATAAACAAAGACCACC 

CAAATGAGAACAAACAAAGACTATTTATCCAGAGCTTGCTCTGACAAGGG 

AGTCGGCAACCATCACTTGCTTGGCAGAGACTCAGAAGTAAGCAGGGGAG 

AAAGCCTCATAGCAGAAAGAAGGGAAGTCTTCATGTATGCCCTGAGTGGC 

AGCTGTAGATGTGGGTGAGTTGCAGGTGGCTAACTAGAAATGGGGGACTC 

CTGTGTGATTGATTAGGAGCATGTTTGGCTTTCTCTGGTTGGTCCTACAT 

TGGAAGAGGGAACAAAAAATTTAGGGCAGTTGTCAGTTATTAATCAAGTG 

TTGGCCATTTTTGACTGACTGTTACAGGAGTGACTGGCTCCCTGGATTGT 

TTGCTAGAAATAGTGGTCTTCACTTCCTGCAAGTCTGACTTTCTGGTAAT 

AGGCTTCCTGGGTTGGCTATTGTGGATAATAAGTGGGTTTCCTGAGCTGA 

TTTCTGCAGATTGTGGATCAGAGTTATTTTATATAAACAGTCTGACCATT 

TTCCACTGGCATATTCCATCTTCCAAGAGCTGGCCAAGCTGCTGTCTTAT 

CTGTCTCCCCCAGCCCCTCCACTCTGGCTGTGAAAATACAAGCCACTAGG 

TGAGGAATGGGGACAATTGAAGACTGAAAGCTTTTCTTTGCTGGGTTCGC 

AGAGCTGAGGAAAGAAATGACAACATCCAAGTGTCTGCCCTGGGCCAGTT 

TTAGGACTGTAGTGGTAATGCAAGGACTGTGTGAGTTTATATTTTCATTT 

GTCTCTCTAACTAAGGTGGAAAAAAAAAAACAGAAAATTGTCTGTCTGCA 

GTCTCTGCAAAAGTCTAACACTGTGCTTCCCAACATTGCAGCCATTAGCC 

ACAGGTGAGTATCAAGCACTTTAAATGAGACTGGTCCAAACTGAGATGTG 

CTCTGAGAATAAAACACACAGCAGATTTCAAAGACCTAGTACATGCCCTG 

ATTTCAAGCTATATTACAAAGCTGTGGTAATCAAAACAGTATGGCATTGG 

GAAAAAAATAGACACATTGGTCAATGTGACAGAATAGAGAGCCCAGAAAT 

AAACCCGTGCATGTATAGTCAACTAATCTTTGACAAGAGTACCAAGAATA 

CACAATGGGGAAAGTCTCTTCAATAAGTGGTGTTGGGAAAACTAGATATC 

CACATGCAAAAGAAAGAAATTAGACCCTTGTATTACACAAAATCTAAAAT 

TAATTCAAAATAGAAAAAGACTTACATGTAAGATCTAAAACCATAAAACT 

CCTAGAAGAAAACATAGGGAAAGAGCTCCTTGACACTGGCATTAGCAGTA 

ATTTTTCAGATATAACATCAAAAGTACAGGCAATGAAAGCAAAAACAAGT 

GAGAGTATATCAAACTAAAAAGTTTCTGCACAGCATAAACAATCAACAGA 

GTAAAGACATGACGTATGGAATGAGAGAAAATATTGACATCTGACAAAGG 

GTTAATATCCAAAATATATAAGTAATTCACACAACTCAGTAACAAAAGCC 

AAATAACCTGACTTTTTTTTAAAATGGGCAAAGTACCTGAATAGGTATTC 

CTCAAAAGAAGACATACAAATGGCCAAGAGATGTATGAAAAGCTGCTTAA 

CATAACTAATCATCAGAGAAATACACAAATCAAAACAAGATATCATCTCA 

CACCTGTTAGAATGGCTATTATTAAAAAATGAGATAAGTGTTGGCCAGGT 

GTGGAGGAAAGGAAACCCTTGTACATTATTCATAGGAATGTAAATTAGTA 

CAGCCATTATGGAGAACAGTATGGAGATTCCCTAACAAAATTAAAAATAG 

AATTACCATATGACCCAGCAATTCCACTTCAAGGAATACATTCAAATACT 

ATCAGTATCTCAATAAGATACTTGCACTCCTATGTTCGTTGCAGCGTTAT 

TCAC CATAGC CAAGATACAGAAACAAGTTAAATGTC CATCAACAGATAAA 

TGGATAAAGAAAATCAGGTACATATATATATACAATGGAATATTATTCAG 

CAAAATCCTGACATCTGAGATAACCTGGATAAACCTGGAGGACATTATGC 

TAAGTAAAATCAAAGCCTGACACAGAAAGACAAATACCACATAATCTCAC 

TTACATATGAAATATGAAAATGTTAATTTTATGGAAACAGAGTAGAATGG 

TAGTTGCCAGAGCCTGAGAGTAGAGAAAATGAGATGCTTGTCAAATCAAA 

TCATCACATTGAATATATATAATCTATTTGTCAATTAAATATTTTAAGAA 

TAAAAAATACCTGGCACCAAAAAAAGAATGCAAAATGTCTCAACAATGTT 
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ATATGTATTGCATTTTG^iAGTGATAATAATTTGAATAT^ 

AATATATTTGAAAAATTAACTTCACCTATTTCTTTCCATTTTTGTTAACA 

TAGGTACAAAAAAAAATTAAAATTACCTATGTGGCTCATGTAGGTGGCTC 

ACATTATACTTTGATGACACTATACAGGCTGGTGACCATATATCTCTTAG 

ACTAGTCTAAGTGATTTAACAGTGGTTCCAGAAAGATCCAGGTTTAACAC 

CAATGAAAGGGCCAGCTGGCTTAGCCCAGCTTGTGTGGGAAATGTTGGGG 

AGXGGTT7AAGACAGGGAAAAGCAAAACTTTTGATGCTATTGACTTTTTG 

AAAAATCTTTTGTGGCTGAAAAAACCAAAACATTATT 

>Contig4 9 

GCTCGAGTGTGTCTCTAAAGCCTTTCCCCCATTGGCTCCACTATACGCAC 

TCTCCTGGTTTCCTCCCCTCTAGCCGCTGTCTTTGGTCTCCTTTCTGATT 

TTGCTGCGTCCTCTGTCCCCTGAATGATTGCTTCTCCACTACGGGGTGAT 

TTTGCTCCCCAGGGGACATTTGGCAATATCTGGAGAGGTCTATGGTTGTG 

TTTGAGGGTGTTGCTACTGCCATCTAGTGGGGAGAGGCTAAAGATGCTGT 

TAATGCCCAGGACAGTCCCCATAACACAGAATTATTCAGCTCAAAATATC 

CATGGTGCCAAGATCAAGAAACCCTGCTCAAATATTAGCATGTGCTGAAG 

GCCCTTCTCTTTCCTTTAGCAATATCTGCCTCCTTAGGGATCTTTTCTAG 

TCTCAGTGGTTTAACATTTAAAATCCCAAATTAGGCAATAAATTGGGCCC 

CAAACTTCGTTAGTATAAAATGTAGAACTGTGTTATTAGAAGGCTAATAA 

AATGACCTGGTGAGCATCTGCAGCTAGCCTCTGAGCAATTCTGGGGACCA 

CGTGCAAGATAAATCCATCTGTTCCCTCTCTGTAATGTGGCGCTACCTTG 

TGGCCGATTTTTCCTCGGGTTAAATATCTCTGGGGATGCAACTTGTCGTG 

GTTAATGGCTGTGTGAGGCCAGCGCGTGGTGATAAAGGAATCAATCAAGA 

CAATATTGAATTTAGAAAGGCAGATTTATTTAGAGAAAAGGAGAGATACG 

TTGCAAGGGAGCAATGGGCAATACAGCAGAGGGAAGGCTGTCTGCAAAGA 

GGCAAGGGCTACGTATGACGTAGGGCTGCTTAGGCTGAATGCTTGCAGAC 

AAGATGCTTGCGTGCAGGTGGGCTGTGAGCTGAGTGCTTGGGTGCTAGTG 

AGCCATTGGCAGCTGACCCTATTTCTTGGAACATTCGCTCCCTGCAAGCA 

TTTTAATGTTAAACCGCCAGGTCAGTTTGAATTTTCTTTTTTCTTTTTTT 

TTTTTTTTTTTTGCCTTTAGTAGGACCTGCCGTTGTGAGACTATCTGAGG 

TAAATTAGACACCCTCCTGGTTTAAGTCACCGCTCCAGTGACTAGGCAGG 

GAGCTCTTCCTTGAAGAGGGTGTGGGCAGTGGGTACTTTGCATGTTGTCC 

ACACCAGGCGAGCTGCTGCTTCAGGGCCTTTGCATTTGCTCTTTTCTTTG 

CCCAAAATGCACTTCTCTCACTGTTCACATGATTTTTCTCCCTCTTTTCC 

TTTTAGTCTTTGCTTAAATATCACCTTCTAGGGAGGCCTTCCCACACCAC 

CTCTTCAAGATTTGAGGGTATGCACCCCCACCCCTAGCCTTCTTATCCCT 

CTCCACTGCTTTCTTCTCAAAGCACTTGTTACGTTCAAATAAAATAGATT 

AGTTACTTTATAGTTCTAATTTTACTATTTTTTGTTTACTTCATCAATAC 

CCATGTAATCTCTGGAAGGAACG7TTCTTTTTGTAGTGTATTTCTAGCAC 

CTAGAACAGTACTTGGCACATGGCAGGTGTTCAAAAGTATTTGTTGATTA 

TTTTCTCAAAGGGCATGGAGTCTTAGAAGTTTGAGAACACAGTTCTAAGC 

ACAGCTGTTTAGAGACTATGGATGATGCTAATGGCTGTATTCCCAGTAGG 

TGGGGCAATTCTCAAATTGACCTGGAATCCTTGAGATCTGGGGACAGTCA 

CCAAGCACTGGGCTCTGTGGGGAGAGATGTGCTGGTTTTTAGAGAGGAGA 

ATAGCATCCTGGGGGACTTGGCCCCAGGGCTTTCCTGTCCCAATCTCTTC 

CCAACTGAGTCCCAGAGGCAGGAGGCCTTGTCTGTAGCTGGTCAGTCCTG 

TAACTGTTTCCCTCCCATCTACACAGATGCAAAGAAGGCTGAGAAAAGCA 

AGCTGTCAGGTGAGCAGGGGCCCTGACTCCTCCCCAGAAGGCACTCAGAA 

CTTCCATAGGGCAACTGGAAAGAAGGTTCTACTTCCTCACCGGCAGCTGT 

TGCTGGGGAAAAAACCAGCCTCAGGCCCTACCCTGTGCTGAGAACCTGAA 

TCCAGTATCAGGTTCTCCAACAAACTTGGATCCAGCTGACCCTCACAAGG 

GGTCAGATGCAACCTTGTAGCATATGGAAAATGGCAGCAAGGTCCTTGTG 

TGGACTATGCCTAGAATCTAAATTAAGACAAGGCCTCAGAGGGGCTAAGT 

GACATCTGTCTCCAAAGTTTCACAGCTAGTGTGTGACTAAATCTTGATTC 

CACCCTCTCAGGTTTTACCATAATCCCAAAAAAGGTTGAAACAAGAAAAG 

TTATCTTTGGGCAATTACCTCTTTCTGTTCCTTGCTTTACCTACTAATGT 

TCTAGGCTCACCCTCTGGTCTGCAATCTCACTGAACTGACAGATCCCTCA 

TGGCCTAAAGGGTTTTCACACTGGGTTGACTAGGCTCTCCCATTGCCTGT 

CCTACTG7CTAAGGCACCTCCTGGGTAGGGTGCCCAGCGTCATTCTGATG 

C7GCCTGACT7TCCTTCCAGCTACTTTTGAAACTTGGTATCCATGGCAGA 
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ggcttaaagggcatgttl.jaggtacttttatttco^ttcccc^Gtggc 
atcaaggaaatcagcatctctggatagctctactaaggcttagttctcat 
tgtccaatctagctcctgggtcatgggaggcattcaggaaatatttgagt 
gtaagagtgagttgctttacctccagaatatccttccaatggctctgaag 
caggctgtggagtcctgctggctgatcacagttcacaggtggctcccaaa 
cctgtggtctacatccatcctttgtcagtgtcactgccattgtcccacaa 
atgtcatttgggcctagcccctgggatagtaatcagtctttacatagata 
tacattgtgctttacatccacagtaattctgagtggaccttaaaataaat 

TCCATGTCAGGTCTCACCAGCCCATGGGTTACAGATGGGGTTACCTTTCA 
GCCTTGTAAGGTGCCCCGTCTTTGAGTGTAGACATGGACTCACAACGAGT 
CCACTCCTGCTGTTCCTCTGCTCTTGCTGAGGCTTCTGCTGCTGCTGCTG 
CTGCTTTGCAGAGGCTGGCCAGCTGTGGTGCCTGAGGCACCTGTGTCTTC 
ACAGCACCAACTTGCATGGTGGCCACGGTGTAGTTGGAAAGGGATGCTTA 
GATGGGAGGCCAATGGGAGCTGCTTCAGGAGGCAAATCCAAGTCACAGAG 
ATCGAGTCACCGAGAGCATAGTAAACTCAAAATCCCTTCTTCTGCTTAAT 
AACTGAGATGCTGTCACTGGGTTAACCTCACCAAGCCTTGTTTTGTCTTC 
ACTTAGAGTGATTTCTGTCTTAGAAGGCTCCTCATATCCTTCTGGGGAAG 
GCTTCTAGTGAGTCCACAGATAGCTGGACCAGGCATGTCCAGAAATAATC 
TGATTCTCACATTTGAGTTAGCCAGCGTTCCCAGCTATATCCCCATTTTG 
TGTCTATATAAGTTACCAAAGCCCACAAGGATATTAGGTGGCTCCTTAGT 
TTGCTTTATGATTATGCCTTGTGTGTGTGTGTGTGTGAGTGTGTACGCCT 
ATGAGGATTCCTTCTCTCCCGTTCTTGCTATGGCTTCTCTTCCCCACTGA 
TGGGCTGTAGTTCCCTGTCCTTTTGACTTTGGGCTTAGTCATGTGACTTT 
TTTGCCAAGGGAATGTGGGCAGAAGTAACTGGGAGCCAGTCCCAAGCTAA 
GGCCTTGGGAAGCATGGTGAGCCTATGCCAGCTCCCTCAGAACTCCTTCC 
CTTGGCCATGAAGAGAGAATAACCTGGATTGTACCTTCAGCCCATGTCCT 
AGAATACAAACATGGAGAATAATGAACTTGACTCAAAGGCTGAAGGGCAG 
CTGAGCCCACATGAGGTCAATTGAACTGCAGCTACCTACAGACCTGAAAG 
TGAAATAAACATGTATAAGTCTCTGACGTTTGGGGTTTGTTTACATAGCA 
TTATTGTAGCAGAAACTTAAATAATACTGGGGGCTAAATATAGTGGACCA 
GTGACAGCACAGAATGGTAAAATGGAGTGATTGTTACTTACATCACAACC 
CTTCATCTCTGTTGATGGACACTAAAATCAAAGTGGCAATTACTCAGAGT 
TGGGAGTCATTGAGTTGCATCATTGTTGTTTAGAATCATTGACAGTTTGA 
GCTCTAAGTGATTACAGAGATGGTTTCCTCAGCTACAGGTAAATAAACAA 
AGGCACAGAGAAGTAAAGTGACTTCTAGAGGGCTTCATTGATATTTAGCA 
GCAGAATCAGAGCTAAACAATGAGTCTCTCATCTCCAGCCTTTCTATTCT 
TGTTTCCTAGGTTGGGATTTTGGGAAATAGTGCAGAGAGATTAGCAGTAG 
TGACATGGAACAATGTGAGCCTCAGCTTCCATCCCTGAGGCTGCCTTCAT 
CTGCCAGGGAAATGTCTCTGTGTGCAGCCTTGCCCTCTGCACACAGTGTG 
TATGGCCACCTGAATAAGTGTCCTTTCATAGCGACTAATGGATTGAAATG 
GGTGCTAGAGCAGTGCTTCTAAAAACTCCATGTATTAATCATCTAGGGGT 
CTTACCAAAAACGCATGCAGATTCTGATTCAGTAGGTCTGGAGTGGGGCT 
TGACATTCTGCACTTGTAACACATGGACCACACTTTGAGTAGCAATGTAT 
TAGATCATTCCAGTGGAAACATGTATGAGTGATGGAATGAACAGATATAA 
TTAATCCAGGTCTGGTAAGTGAGGTACTGATACATATTAAGTTGAAGTGA 
ATTTCACATCAAAAATAATGGTTACACAGTGACTTTTACTGCCCCCAAAT 
TCTTTCCTTTTGAGTGGTTTCAAAGTGAACTGAGCCAGCCAGGTTAAGTC 
CCTG GTTTA GTGTGTGATTAGAAGATTTGATCGAGCTTTCTCCTCCTTCT 
AATTCTTTAAATATGCAATGGCCTTCTAGAAACITGTCTCTCAGGCTCCC 
CATGAGCCACCTGTCTTAATATCTTCCCCCCCAGGACATTTCCTGGGTCA 
AGGA AGG AATCAGGGACTAGGAAAAGTAGAAAGGTTGCCTGACAGTGAGA 
AACTTTTTGCACTCCTATTTGTTCAATTCTAAAATGTGGGTATTGTTGGG 
GCTTCTAATTGGAATCTAACCTGAAATTCAGGCATGTCTAGCTATATATG 
ACCAAGAATTAGGATGAGTTCACTAGAAGCCTATTTTCAGGAGAGCGGTC 
AGTTAAATTGAAGTTTATGGGTTTATGGTAATGGGTTGGGGAGTTTACTT 
CATTAGCAATAGCAACGTTTTTGAATCAGAGAAGTGATTTTGAACACACT 
GTACATAGTTTTCTCACTTAGATTTATCTCTGGGTCAACCCTTGTTGGAC 
CTATATTAGAATCATTTAGTGAAGAAAAGGTGGGTGTCATTAGGAAAAGA 
GCCATTTATTCAAATGTTCTGTTTGACATTAGGGCACTGGCAAGACTA^ 
GAATCAATAGATATTTAAAAACAGCCAGGTGCGGTGGCTCACGCCTGTAA 
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■'TCCCAGCGTGATTTGGG *rTACTTTGGGAGGCTGAAGCGGGTGGATTG ^ 

TGAGCTCAGGAATTCAAGACCAGCCTGGTCAACACGGTGAAACCCTATCT 

CTACTAAAATACAAAAAATTAGCCGGGCATGGTGGCAGGCGCCTATAATC 

C CAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTG AACCCAGGAGGCG 

GATGTTGTCATGAGCTGAGATCGCGCCATTGCACTCAAGCCAGGGCAAGA 

ATAACAAGACTCTGTCTCACAACAAACAAGCGAACATACGAAACAAACGT 

AACATCCAAACTAGCAGGTACATGCCGTGCCAGTCATGACCCATGGTCAT 

AAGATGTCTACAGCTCAGGAAGCAGCTGCACAATGCCTGCATAGACAAAC 

TCTTATGAAAGCAGAATGTCCTGATGTCTCCATAACACATAACAGTGTAT 

GCTTTTATTATGGTCATACTCTAGCTGTGATGTACCTACGCTCTAATATG 

CCAACGATAGTTTTCTTTAAATCATCAACATAATAAATGTCATGCTGTCA 

GTCCCCCACATGTAGACATAACTTAGCTGGTACATGGATAAGAAACCTAT 

A TTA GATAACCTTAGGCCAGGTGTGGTGGCTCATGCCTGTAATCCCAGCA 

CTTTGGGGAGGCCGAAGCGGGTGGATCACGAGGTCAGGAGATCGAGACCA 

CCCTGGCTAACACAGTGAAACCCCGTCTCTACTAAAAATACAAAAAAAAA 

TTAACCGGGCATGGTGGCAGGCACCTGTGGTCCCAGCTACTCAGGAAGCT 

GAGGCGGGAGAATGGCGTGAACCCAGGAGGCGGAGGTTGCAGTAAGCCGA 

GATCACACCACTGCACTCCAGCCTGGGGGACAGAGCGCAAGATTTCGTCT 

CCCAACCCAAAAA NCNA NNNNAAATTTGCACCCAAATCTGACTAATTCCA 

GAGCCAATTCCAATTTAGAATCGTTATATCTCCCTGGTGAACTGAAGCTT 

TTATCTTTAAGGAGACACACTCTTTATGTCTACCAATGCTTATTGeCTTA 

AAGTCCACTTTGTCAGATACAGCTGCTTTCTTTTAATTAGTTTTTGTGTG 

GTATATCTCTTTCCATCCTTTTTCTTTCAGCCTTCTCCATTCTTACATTT 

TAGATATATTTCTTTTTTCTTTTTTTTTTGAGAGAGAGTCTCACTCTCTC 

GCCCAGGCTGGAGTAGTGCAATGGCGCGATCTTAGCTCACTGCAACCTCC 

ACCTCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGG 

ATTACAGGAGCCCACCACCAAGCCCAGCTAATTTGTTGTATTTTTAGAAG 

AGATGAGGTTTCGCCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCA 

GGTCATCCACCCACCTCGGCTTTCCCAAAGTGTTGGTATTACAGGCGCGA 

GCCACCATGCCCAGCTGATTTTAGCTGTATCTCAAAAACAGCATGGGTTC 

TGTTTGCTTTCCTTATTCAGCTTTATAATGTAAATCATTTACATCAAACA 

TCTAATACACCATGGACTGTAAAACACAGCCATATTTTATGTATGAATTA 

AAAAAAAAAACACCACCAATTAGTTCCTGAGACACACACCTTAACAATAT 

CTCTGTGATGTGCATAAATCAATCACATCAGTTTCTCTGCACCTCAAAAT 

TTCTTTCCTC^^TTCTCAGAGATATGGCAATTTCTCTGGTTTTACATTCC 

CAGAAGCAAAGAAAAAGTACACAGCTTCTTCAAGTCATGAGTAGCTTCTT 

TTTTATAGCTCTTGGTGTTTGCAAAAAAGATTGGAATTGCTTCACTAATA 

CTAAATTTTCATTCTGCTGCTCTGTTTCTATGACAAGTCAGAGGGCATCT 

TTTTGAAGACATTCTAAACAGCAATTAAACTCAAAACATGTAATGACAAT 

GACACACAAAACTGAACTGATGACCAAATGAAGAGTTCCAGCCAAGTTGA 

CACAAGCTGGCTGACAGAGCTTGTAATACACACAGCTTGGCATATGCCTC 

GCCATTTCAGAGATGTAAAAATAGGAATAAATGTTTTCCCTTAAATCAAT 

GAAATAGAGCATTTGGACTGAAAATCTACGACZAGTTATAGTGTTTTCTAT 

TCATTATTCTCATTCTGTTTCTTCTCCCCCTTGCTTTCTTTTAGTTTGAA 

TATTTTCTATCA TTT CATTTTTCTTCCTACTAGTTTGAAACTTATGCATT 

TATTTTCTATTTTTTAGCACTTACCTAAAATTACTCTGTAATCCATGGAT 

CCTTAATTTATTTAAAAAACTAATGTTAATGAGTAGCTTTATTTTCCTCC 

CATCTAATTTAAGGCCCACAGAACACCTTCACTTACCTCAATCCTCTCCC 

AACTTACATGCTTTTAATGTCATATATGTTAATACCGTATACTTTTAAAA 

CTTTCTAAAATAGCATTATTTTATAGCATGAGTGTTCATTTACATTTTTG 

CATATATTTAGAATTTTCTTTGCTCTTCGTTTCTTCTTCTATTTATGACT 

CCCCTCTGGGATCATTTTCCTTCTACTTGAAGTACATAGTTTAGAACTGC 

ACTATTCAATACAGTAGCCACTAGCCATGTGTAGCTATTGAAGTTTAAAC 

TAAGTAAAATTGAGTAATATTAAAAACTCAGTTCCTTCATCTCACTAGCC 

ACATTTCAAGTGCTCAGCAGCCACATGTGACTAATGACTACTGTACAGCA 

AACATATAGAAGATTTCCATCATGGCAAAGAGCTCTATTGATAGTGTTCA 

TCCAGAGTTTCTGTTCCAGGACCAAACTGAGGGTTGGGCTGCTATTTCTC 

ATGGCCCAATAACAAGATGCAGATGAGCTGGGGAGGAAGAGAGTTTTTAT 

TTCTGCZAACCAGTTACAGGGAGAAGGCCTGGAAATG^TCACCAGGCCAAC 

TCZAAAATTATGACGTTTTCCAGAGCTTATATACCTTCTAAGCTATATGTC 
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TACGTGTAAGTGTGC^TlCACCTGAAGACGTAAGTGATTAACTTCTT4 , 'iA 

ATCTGTAACTAAGGTCTGAGTCCGGAAGATCTTCCCCTGGAGCCTCAGTA 

AATTTACTTAATCTAAATGGGTCCAGGTGCTGGGGTAATTACCCTTATCT 

TGTCCCCTGCTAAATCATGGAGGTTTGGGGAATTCCCTTTAGAGCACCAT 

TAACC7GTTTGTTGAAGGCCTGGGAATTTCCTCCAAACCCCCATTAAACC 

TGTTTAATCCCAAATTGGTTCCGTTAAAAATTCCCTCCTTAATTTGTCCA 

ATJTTAAAGGCCCAAAAAAGGCTGGGGCZAAACTCCTGAATGGCCTTTGTT 

ACATTCCAACCTTTGTTTAAAAACACCGGTTTTTAATATTTAACTTAACC 

ATTTAATCTCTACTGAAACACTTGTTATATAAATCTGCATTAATGAGAAC 

TGGCCTGCGCCATATCTCCTTCTCAGAATATCTTAGGGTTGTGATCCCCT 

GTGTGAAGAGAATATATCTCTGGAGATCTCAATCTCTCTACCCCAAAAAA 

AATCTCACTCGGAGAAAACTCAGACTCTTATCTCCACAGCGCTATCTCTC 
TCCTCTCC 

>Contig50 

GCTTGTCTAAGATGGTGCTCCTTGTTGCTGTGCCTGCTTTCATCCTGGGA 
TCTCCCTTCACCATCAGGATTGCCTTCACCTCATTCCAGTCTTGGATCTT 
TCTTCTTGTTTCTTGAGTATTTTTTTTTTTTTTTTGCTGCATTCCCTTCA 
GTGGCCTCTTGGGAAAAGATGTGTAGGGAGAAAAATTTTCTTTAGAAACT 
TGCATATCTGACAATATATTTATCCTATCCTGACATTTGGTAGATAGTTC 
AGCTGGGTACAGAATTCTAATTAATTTTCCTTCCTGATTTATAAGACATT 
GCTCCATTTTCTTCTGGCTTCCAATATTGCTGCTGAGAAGTCTGACACCA 
TTCAAATGCCTGATTTTTTCCATGTGATTGTTGTTTTCTGTCTGGAGTGT 
TGTAGGATTGCCTCTTTATCTACAGTGTTC7GAAATTTCATGACGTAGGT 
CTTTCTTCATTCATTATGGTAGACACTCAGTGGGCCATTTAATCGGGAAA 
AACATGTGTTCTTCAAGTTCTACAAACTTTATTACTTCCTTTTTCTTGTG 
TCTTTCTCTGGTCTGTTTTCAGCCCCGAGTCTCTTAGATCTGTCCTCTAA 
TATTCCTATTGACTTTACTTCATTTTCTAAGTCTTTATCCTTTTGCTTTA 
CTTTCCGAGAGACCTGCTTAACCTTATCTCCCAACTCTTTTATTGAATTT 
CATTTCTTTTACTATATATTTTTTACTTTGAATACACCTCTCTCTTCCTC 
ACATTTTCCCCCATAGTATTTTGTCTTCAATTGACAGTTCTACTATCTTA 
TTACTCTGGAGATATTAATAATAGTTTTTAAATTTTTATTTATTTTTATT 
TTCAAAACAGTGTCTTACTCTGTCACTCAGGCTGGAGTGCAGTGGTGTGA 
TCATGGATCACTGCAGCCTTGATCTCTGAGCTCAAGCTATCCTCCTGCTT 
CAGCCTCCCAAGTAGCTGGAACCACAGGCATGTGTCACCATACCCAGCTA 
ATTTTTTTG7TTTTGAGGTGGAGTCTCACTCTGTAGCCCGGTCTGGAGTG 
CAGTGGTGCAATCTGGGCTCACAGCAACCTCTGCCTCCTGGGTCCTGGTT 
CAAGCAATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGAAACA 
CACTACCATGCCCAGCTAATTTTTGTATTTTTGTAGAGACAGGTTTCACC 
ATGTTGGGCAGCCTGGGTCTGAACTCCTGACTTGTGATCTGCCCACTTGG 
GCTCCCCAAAGTGTTGGGATTACAGGCGTGAGCCACTGCACCCGGCCACT 
AATTTTTAAATTGTTAATAAAGACGAGGTCTTGCTATGTTGCCCAGTATG 
GTCTTGAACTCGTGGGCTTAAGTAATCTTCTGCCTCAGCCTCCCAAAGTG 
TTGGGATTACAGGTGTGAGCCACTGAATCTGACATTTTTTAAAAGTTTTC 
TTCTCTTTACCAAGTCTTTTTTCCCCTTTCTGCTTTTTTGGGTTGTTTTA 
TTTTGATCTCTATCTTGCTAGAAACTTTCTGGAGACGTTTAGTAATACTA 
GATTTTTGAGAGTGGGCAACTGGAAAGCTGATTGGAAACTCTGAATACAT 
GGGTGAGGCTTGTTGGCTGTGAGTGTCATTGCTTGATGTCCTGGCAAGGC 
CAATGGGTTTGGGACCCCTACTATTAGTATAGGCCTGATTCCCTGGGAAA 
GGCTCTTTTGATCTCCTGCCTGGAGGATAAAGGCCTGGCTACCAGCCTTC 
TGTGTGTAATGTGAGGGAGAAGGGCTGGAGTATTCAACATCATGCTGAAT 
CCTTTCAATGATCATCTTGTTTTTAGTAATCTCCTACCTTAACTCTCTGT 
CTTCTGCTAGTATGGGAAAGATGACCTGAAAATCTAACCATTTATTTTTC 
CCCCATTAATATCATTTTATGATTATTCAGAAGTTAAATAATTGTCATGC 
TGTCCTCCAAAAAGACTGAATCAACTAGCAACAAATAAGAATTTTCTCAC 
AGCTCTGCCAGCATTTTAAAAGAATAGCTTTATTGAGCCCAGGAGGTCAA 
GGCTGCAGTGAGCTGTGATTACACCACTCTACCCCAGCCTGGGTGACAGA 
GCAAAACCCTGTCTCAAAAAAGAAATTTAAGGAACAGCTTTATTGTTGTA 
AAATAGACATACAATAAACAGAGCACATATTTAAATTGTGCAACTTATAC 
TTTGATATAACCCTGTGAAAACATCACCACAATCAAGATAGTGAATATAT 
7TATCACCTCCTGATACAGTTTAGCTCTGTGTCCCCACCTAAGTCTCATG 
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TTGAATTGTAATCCCCAATGCTGGGGGAGGGGCTTTGTGGGAGGTC5ATTG 

AATTGTGGGGGTGCACTTCCCCCTTGCTGTTCTTGAGATAGTGAATGAGC 

TCTCATGAGCTCCCCTTCACTCACTCTCTTTCCTGCTGCCATGTGAGGAT 

GTGCTTGCCTCTTCTTTGCCCTTCTGCCATGATGTGTTTCCTGAGTCCTC 

CCTAACCATGCCTCCTGTACAGCTTGCAGAACTGTGAGTCAGTTAAATCT 

CTTTTCTTCATAAATTACCCAGTCTCAGGTGGCTCTTTATAGCAGTGTGA 

AAAGGAACTAATATACCTCCTAAGTTACCTCAAGCTTCTTCTTAATTCCT 

TCTCCTCCCTTCCTTCATTGCCAAGCAAACAACCACCTGTTTTCTGTCAC 

TATAGATTAGTTTACATTTTGTGGGTTTTTTT TTTT TTTGAGACAAGGTC 

TCACTCTGTTGCCCAGGATGGAGTGCAGTGGTGCGATCATAGCTCATTGC 

AGCCTTGAACTCCTAGTTTCAAGTGGTCCTCCCACTTCAGCCTCCTGAGT 

ACCTGGGACTACAGGGGTACACCACCACAACTGGCTTAAAAAATTTTTTA 

AATAAAAATGGGGTCTTGTTATGTTTCTCAGGCTGGTCTCGAACTCCTCG 

CCTCAAGCAGGCCTCCCTCCTTGGCCTCCCAAATTGTTGGGATTACAGGC 

ATGAGTCATGACTCCTGGCCTAGTTTACATTTTCTAGAGTTTTGTATAAA 

TGGAAACATACAGAATGTATTTTTTTGCGGAGTGGGGGAGTCTTTCTATT 

TCTTTCTTTCTTTTTTCTTTTTTTTTTTTTTTTTTTGAGACGGAGTCTCG 

CTCTGTCTGTTGCCCAGGCTGGAGTGCAGTGGTGCGATCTCGGCTCACCG 

CAAGCTCCACCTCCCGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCTGAG 

TAGCTGGGACTACAGGCGCCCGCCACCACACCTGGCTAATTTTTTTTGTA 

TTTTTGGTAGAGACGGGGTTTCACCATGTTAGCCAGGATGGTCTCdATCT 

CCTGACCTCGTGATCTGCCCGCTTCGGCCTCCCTAAGTGCTGGGATTACA 

GGCGTGAGCCACCGTGCCCGGCCCAAGTGTTTCTATTTCTTAACCAGCTT 

TCATGCAATCTTTTTTTATTTTACCATCTCTGTGATCCCACTCCCAAAGG 

TACTAGATGTCGATTGGTCCTTAGGATCAGCTACCATTTGCCCAACTGCT 

TTCCAGCCTTCCAAAAATTTTTTTCTTTTTTTCTTAAAGATACTCCTGTG 

TGAGGCTCAGAACTCTTGAATTGCTACTGCAAATATGAACTCGGTGATGT 

GAATGCCAGGGAATTGCCTGATTGATCAAAGAAATGTATCCCCTTCTCCC 

TCACTCTTGCTGTCTTCTCATTTGTTTTCCCCATCCTTGTGGATTCGTGA 

ATTTAAATATCCCTTTAATGTTATAATATTTTAATGGCGTTTGGCGAAAA 

GTACAGAATTAGGTGCAAGAGTGCZATAGCTGTTATTTTTTTTTTGGCCTC 

TGAGACTGTTCATATATGCAAGTTATTTAACAGAAAGTTCTGCAGTGACC 

TGAGATGTCAGGGGGGTCTGATAGAGTACGTTTGAAGGCAGTTACTGGAA 

AAAAATAATGCCATTTCTGGTTTGTACTTCGGTAAGTTCAGATGACCCAA 

TATATTGTTTACATGTGGCATTCAGTAAAAAAGTAGCTTCCCCTCCCTTT 

CTTCTTCCTTTTCTCCTTTCCTGCTTCTATAAAGCATCTGCTTTGGGAAA 

CTTCTTAGGAGGAGAGCTTGCCAGCCCGTGGGTAATGGAGAGGTCTTGCA 

GAGATAAAAGAGATGCTCCCACTCAATGCAGGATGGTGTGGAGGTAAATG 

GGGATACGTCTGGCATCACTCAGGAATGGGCCTTCCTGGCAGGGAAGAGA 

AGGGAGGGGAAAGAGGAAGGGAGTCAAAGATGAATTGCTGAATACGGGGA 

TTCCAGGGCCTGGAGCCAGGAAGAGAACTTTGGGAGGTGTGAACCTGGAG 

GGCATCAGCTGATGAGGAGCAGCCTGAAGTCCGGGGAGGACCTGTT7TTG 

GTGGCCAGGAAGAAAGTGCCTTCCACACACAGGGAGGCCACAAGGCTGAT 

GGGCTGGGGGTTGGAAGGACAGCCCTAGGACAGGCTTGGGAAGCAGGCTC 

AGGTAGGGACTGCGAGGTTCTTGTTGAGTCTTTTTCATTCCTGGTCTTAG 

AAAATAGAATCCAAGGCCTCTTGAGAGTGGAAGGTGGGTTGGGAGGAGGG 

CAGATGGGGCTTAGGCCCAGGACACCCGTAGAGCTACTGCCCAGCTGTCT 

CTCAGGGACTCTGCTGAGGTCACTCCAAGGATCATTCTTAGCCTTGCTAG 

ACAGTACTGACAGAGGGAACCGTAGTATCGCACCCACTTCCTTCTCTTTC 

AATGAAAGTTTAAAGGTCACCATTTCCTCTGGCAAAGGAAGTTCCACAAA 

TATTCCATTTCCGGTCTTAGAAACAGCAAGGTATCAAGCAATTGCAAACT 

TCCTGTGCTGGGGAATTCCCAAGGAAGTAGGGGCAGAGTTCTGGTGGAGA 

CAAAGTGAATTCCGAGTGATTAGTCAGTAGCAGTAGCAGTAGCAGTAGCA 

GTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGC 

AGCAGAACCAGAATTTCCCCGCACGTGTCTCAGGCTCTCATTTGCCAACT 

CAGTCTCTAAGTATTTTTATTGGCAGGAAAAATAAAATAGCTATGAGTGA 

AATAATTCATTAGACCTGAGCCTCCATCAATTTTGTGTTTAAAGGCCTGA 

CTCTCTTTACCTTTCCCTGGGATGGAAGATGCAAATGTTCCTGATGTCAC 

TGTCAAAAAAGAAGAACCAGTGGGTATATTGTATGCTTGAGTTCCAGCCA 

TTTGTCACAATAGATAGAGATGACTGCCATGTGTGTAGACTTTCTATAGA 
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CTGTGTGCTAAACCCGAcCTGCCACTTCCAAGGAGTAGATGAGGAkTG A C 
CATGGTTCTGGGGAGCCCTACCCCAATTTGGGGCAGACATTCCAAAGCTC 
ATTTTCTGTGGAGGGGGTTGATGGTTAAAGGACGGCCTGGGAGTAACTCG 
TCTGTACTAGGGCCCAGGAGAGTTACATGCTGCTTCCCATGTTATTCATC 
ATTCCCCCATGTGAATAGCTATGGCGTGAGGTCCAAGGTTAGGGCCTTTC 
TACCATAAATGGGGGAATAAAATTCCCCTACCAGCCTGAGAAGTTTCTGT 
TATAAAGAGGCTTTTTTTTTGCGGGGGTGGGGGAGCAAGCCGACTAATGT 
GTTATTTCCATACGGTTTGTTTTAAAATGTAGATGTCATATGCAGGAGAG 
GTGGTGTAGTGAGTCACAACGGGATTAGAAGGACCAGTCCGAAAAGCAGA 
AGAGGGTCAAGTTCAGGGCACTGAGGACTACTGCATTCAGTGGCGTGAAA 
GGCAGATGGCTGAACAGGAGGGGGACATTACATTGCTTGTTCTCCTTGAG 
CCTCGATTTCCTCATCTAAAAAGAGGGTCATTTATTCACAGAACATTTAT 
TAAACT7GTGCCAGGCACCGTGCCAGGAGCTGGACTAAAAATTAAATCCA 
CCCCTGTGAGCTGCTCTGAAGGCTAAAATATGAAGTATGTAAAAGTAACC 
AAGTGCTGTACACATGCAGCTATTCAATGACTGTGTGGGCATTGCGGCAG 
ATTTTAATTTTCTTTTTTATTTCTTTCTCTTTAGTGAGAGGXGTTGGTTG 
TTATTATTGTCGTCGCTGTAACTGTCTATTTCACTTGCTTTTTTGTTGCC 
TCCAGCCCATTCCAGGGCTGTCATCTAAGACACTTCTTATCACCTAAATA 
ACCGGGGAGGCAAAGCGCTTTCTTAAGAGATGGATCCAGAAGAACAATGC 
TGGTTTTCTGTAGAAAAAGGGGCTGTGGGAAGTAGAGATAAGAAGGGAAT 
TGGCCAAGATGAATGTACAGAGCCTTATTTTTTTTTTATAACACAGCAAG 
ATTAGATACAAAACAGGACAATAGCATCATCTGTTTTTATAACTGGAAAG 
GACCTCACTTTACAGGTGGGGAAGAATAGAGTGGAGAAGTGAAGAGAATG 
GTCACAGAGTCAATCAGCATGTCTGCGTCAAAGCTGGGATTCCCAATTCA 
GGGCTCTTACTACAGTGACGTATGGCTAATATTTTGGCATTGTTTCGGGG 
AAAAGCTGAAGCCCTGATGGTGTACGTCACTCTTGAGATAGTCTGTAGTC 
CAGCAGGGAGGAAAGCAAGGAAGGGAGGTGGAGGCAGCATTTTTGGGTGT 
AACATTTCGTTCTTGTTTTGTGGCCAAATCATAGTGTGATTGGGACAAGC 
CACTGCCTTTCTCTGAGCCTCCACTTTCTTTTTCTTCTTAAGAGGGAGGG 
AATAGTAGAGTAAAAGTAGTCATTTTATCAAACACCTGCTATTTTGGAGC 
CATATTGCAAGTGGGTTGGGGGTTGAACZACTTGGCTTTATTACCCATAGG 
ATTAAATCCAACCTCGATACTGTGGCATTCCCAAACTCCAGTCTAATCTT 
CTTCTCCATCAGCCATGCCCCACGACACCCTGGTCATATCTGATGTTGCC 
CCTTGCACTTGCCCCCTCCTTATCTTTGCTTTCTGACCTACCATATGGCT 
ATTGGTTGAAATTCTCATTTTCCAGGGCCTTGCTTAAATATCATCTCATC 
CATTAAAACTTTCTTGAACCTCCCCTTGCCCTGTTCCTCCCTAATGTCTC 
AAGCCAGAATTTATTTCCTTTTGTGGCCAAGGGACTGGGTTTGTGACCTC 
TCTCACGAGACTTAATATTGAGACCAAACGTCTTTAGACCTCACCAGCCA 
GAGAGATGAGCATCTATGGAATGCAGGCTTTTGCCTGGACTTGCTGATGC 
AGGGCCTCTGCCTTCCTCCAGGGCCTCTCCTGCTGTTTTAGGAATTTCCC 
TCATGGCACAGTCCATGAGCTCAGGGTCAAGTTCATACATGTTTTTACTT 
CTTCTACTCTGCAAATGGTCTTCTTGAACTCTGAGGGTCCTAAAGCTGCT 
CTGCAGTTTGTGGGGTGAGTAGAAAGGGGCTTTCAAAAGTTGTGCTGTTG 
TTT CCCA CCCCAATAGCATGAAACACAAAGATGCTTACAAATAGCTGCCT 
TGCTTTCTAGTCCCAACTTCTCTCTCCTGAGGCTTTAAAACAAGTCCCCT 
AGGTTGAGCTGGACTGGAGTTGTATCCTATCTTCATTATCTGTCTACTCT 
CTTTCTGCTCTCTAGAGAAGATATTATATATGTGTGTATGTATGTGTAAA 
TATATAATATCCATATATAGAACATATATTGTTATATTTACATATACATA 
CATAACATATGCATGTATTCATATATACATATGTAGTATCAAAGTTGGAA 
TTAAACTGTATATTTTGTAATTTGCTTTTATTTGCATCTATCACTGTAAA 
ATGAATATTTATCCATACCGTAAGATATTCTTCAATGTATTTTTTTTTTT 
TTTGAAACAGGGTCTTGCTTTGTTGCCCAGGCTGGAGTGCAATGACCCGA 
TCTTGGGTCACTGCAGCCTTGACCTCCCCGGCTCAAGTGATCTTCCCACC 
TTAGCCCTCTGAGTAGCTGGGACTAAAGGTGTGTGCCTCCACACCCAGCT 
TTTTAATTTTTTTTGTATTTTTTTTTTAAAGACAGGGTTTTGCCACATTG 
CCCAAGCTGGTCTTGAGCTCCTGGGTCCAAGCAATCCTCCCACTTTGGCC 
TCCCAAAGTGCTAAGATTACAAGCATGAGCCACCACACCTGGCCTCAATG 
TAATTTTTAATGGCTGTATAGTATTCCATCATGTGGTTGTACCCAAAATT 
ATTTAAC CAGTC C C CAGTTTATTTCAATTTTTTTTTACTATTTTGAATAA 
TGTTTTAGTAAATACCCACAAAATATGTACAATGGCTGGGCTTAGTGGCT 



FIG • 4 (38 £ 61) 

nli if 



WO 99/06426 



PCT/US98/16102 



CACCCCTGTAATCCCAA^CTTTGGGAGTCTGAGGCAGGTGGGT^cdrG 

AGG7CAGGAGTTCGAGACCATCTTGGTTAACATGGTGAAACCCCGTCTCT 

ACCAAAAA7ACAAAAATTAGCCGGG7G7GG7GGCACACACC7G7AA7CGC 

AGCTACTTGGGAGGCTGAAGTAGGAAAATCACTTGAACCTAGGAGGCGGA 

GG7TGCAGTGAGCCGAGATCACACTACTGTACTCCAGCATGGGCAACAGT 

GAGACTCCATCTCAAAAAAAAAAAAAAAAAAAAAAAAGTACAATTTGTTG 

TACCTCCCTGATTATTTCTTTTAAGTAGAATTTTCTTATAATTTTTTTTA 

TAAGTAAAATTTTGAATCAAGGGAGAAGCACCTGGAGTCCTTCAGATACC 

TATTGCCAAACTGAACTTTTCTGTTCCAGGTTTACTACATTCAGCCTGAC 

TCAGGGTTTGGGGAGTAGAGGAGGGGGTGGAGGCAGAGGGCCTCTCCCTG 

TCCCCACAGACCTCCCTTGGTGAGGTCCAAGTCTGGACAGGTGGAGTGTG 

GCATTGCACCGTCAGGTCCTGCTTCCTGTAATTCCCCTAAATCCATCCAG 

TGGAGCCTCATTGTTCAAGTCTTTTTTTTTTTTTTTTTTTTTTAACTCCC 

CTGAAGACGGAGTCTCACTCTGTCGCCCAGGCTGGAGTGCAGTGGCACGA 

TCTTGACTCATTTCAACCTCTGCCTCCCAGGTTCAAGTAATTCTCCTGCC 

TCAGCCTCCTGAGTAGCTGGCACTACAGGCGTGTACCATCACGCCCGGCT 

AATTTTTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAG 

GCTGGTCTCGAACTCCTAACCTTGTGATCTACCCGCCTCTGCCTCCCAAA 

GTGCTGGGCTTACAGGTGTGAGCCACCAGGCCTGGCCTCAAGTCTATTTT 

TTAACTCCAGGAGGCCTGGTATTCAGAGGGATTAGGGCTGGCAGAAGGGC 

CTCAAAGCTTTCAAGGCCTGGGGAATAGGCTGCAGCCTGGTTCAGGGTAA 

CCCAAGTGATTTTGGTTCCAAAGGGACAGGAAAAAAAGTGATTGATATGG 

AAGTTGTCAAAGTGCAACTGTCAAGACATTAAAAAATGTAACCCTTTTAC 

TAATATACAGTAGACTTGTGTTAAATATTTAACTGATTGTAAAAGGAAAA 

AACCAGACGCAGTTTTCCCTACCATACTGTCACAACACCTCAACACTGAG 

TTCTTCTGTGACCTCTAGTCACCGAAATGCTTGGGGATTTCTCCCACCAC 

TAGTCCTCCAGCAGCCGACACCAGTTGGGTGTCCTAATTCACTCCAACAC 

TATCTACCTGGAGTTAGCGTTAGATCCCACAGGTTGAGGGCTCAGTCTCA 

CAAGACTGCCTCCCACTTCAGGTGCCAGTTACAAGTGGTAGGTTGTCACC 

TATGCTTCTGACTGATGGCTATAAATCTGGGTTTGCTTCCCTCGGGTTCC 

GTGAATTTGCTAGAGCAGCTCACAGAACTCAGGAAAACACTTAAGTTTAC 

CAGTTTATTCTAAAAGATATTACAAAGGATACAGATGAACACCAGATGAA 

GAGATGCGCAGAGCAAAGCATGTGAGAAGGGGTGTGGAGCTTCCATGCCC 

CTCTGGGGCACCACCCTCCAGGAACCTTCATGTGTCCAGCTATCTGGGAG 

CCCTTCCAAACCCTGTCCTTTTTGGGTTTTTAAGAGTGGCTTTATTACAT 

ACACATGATTGACCGAACCATTGGCCATTGGTGACTGACACAACCTTCAG 

CCCCTCCACTCCCTCCAGTGGTTGGGGAGTGGGGCTAACAGTC7CAAGTC 

TCCAATCCTGCCTTGGTCTTTCCTGTGACAAACCCCATCATGAAGCTACT 

GCATTGGGGCTGCCAGCCAGCAGTCATCTATTAGCATGCAAAAGACACTC 

7TATTATTCCAGAGATTCCAAGGGTTTTTAAAAGCTGTATGTCAGGAAAC 

AGGAGATGAAGAACAAATATATATTTCACAACZATCACACTCGTT 

ATTGACAGGATAGCAAAACTGATTAAAGGAGGATAGGAGAGACTGAGATA 

TATATTTCCATATATATATATAGAGAGAGAGAGAGATATTTCCATATATA 

TATATAGATCTAGAGAGAGAGAGAGATAGAGAGAGAAGAGTCTTTCC 

>Contig51 

ACACATTTGGGGGAGCAGTTCCGGAGGTACAGCCCGGACAGGAGATGTGA 
GAAGATCGTGGTTANTGTTCCCCTGGTCCAGAACCCCTCCAAGTGGGCTT 
AAGTAGGAAGGGTGGTGAGCGGCAGGTAAACACACGTCAAAGGCAGTCTT 
CCTCTCTGAGGGAAAACACTTGTATAAGCATTGCAATCAATGGGCCTCTT 
TAATTATGTGCCAGTGGCAAGAGCGGGTGCTGAACCCAGGGGCCTGCCTC 
AATCCGGGGCCTTTGAGGCAGAATAAAGTGGTCTCAGGTTGTTGGCATTT 
CCTTGCCC7TCCACCCGAAGCAGACACAAATCCTCTCTGGAGGCAAGTTC 
CCCAATTCAGCCAGTACAACTCCCACAGACTAAGATCAATCATGTACAAG 
CTCACAGACAAAGGTCACCAAACACACAGAGCAATAAACAAATTCATGAG 
TGACGTGAATGAGAATAAACAGAAACAATAACCACCAGCTGGGATGCTCT 
AAGTCTTCAGCTGTTAGAATTCCTGAATATAGAATAAAACTGCCACAATG 
GCAAACATGCATCTAGTACTTACTGTGTGCTGGGTTCTAAGAATTTTGCA 
CATTGTGCCAGATACCGACTCAGCTTCACACTCACCCTCCTACTGTGCCC 
7CT7AATTTGCAC7AGATTAAAAGGTAGAAAGGAAGAGGCAGCTATTCTG 
T7C77GGC7G7GCC7C7GGCAGCACA7GCAAAA7GGGCAG7AACAG7GGC 
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AGTCACAGGTAAGTAGC ITCTCACAGx ^ rGGAGTTAAAGGCATGGGAv. J 
GAGACGAGCAAGGTTCCTAAAGGGACAGTGGCCAGTAAATGACCAGGGGC 
TACTGGAGTGGCTGCATGGCTCTGTGGAAGCTCAGAGGAGCCTTGGGTCC 
TGCAGGTGCAGTAGCAGCTTTCTGTAGTTCCTGATCTCTGGGTCCCACAA 
TCTTCCCCGTTTTTGCTCCTCCACTTCTAATTTTGTAACTGACTTCCCTG 
TGTGTACTTCTCTCTCTGATTGAAATAGCCAGACTGGTTTCTGTTTCCTG 
ATAAGACATTGTCTGGTACGAACACAGTAACTCATTTAATCCGATATCTC 
TATGAAGGAGGTACAATAATTATTCCTATTTTACAGATGAGGAAACACAG 
CAGAAAAATAAAGTCAATTGTCTAAGGTTGCACATTTAGTCAAGGGAAGG 
GTTGATATAACATATAATTATTTAGAAAACATCTAAGGAAATAAAAGGCA 
TAATTTAAAAATAAAACTAGGCAGGTTTAAAAAAATGAAGTAATCTATAA 
GTAAAAAAGTATAATTGTTGAAATACATATCTTAGTGGATGGGTTAAATA 
GCTGAAGAAATGATTAATGAACTGGAAGGTAGTTCTGAGGAAATCAGAAT 
T CAGC ATAGAT AGAAAAAATGGGAATTTACAAAAGTACACAGGAATTATA 
AAAGAGGTTAAATTATAGGGAGGGTAGAATGAGAATTAACATTGGTCTAA 
CTGGAATTTTGGAAGAAGAGAATAGAGAGAATGAACAAGGCAATATTTAA 
AGAGG7GGCTGAGAATTTTTCAGAACCAACACAAACTATGACTTTACCAG 
TAGAGAAAACAATGTACACTGAGGAGGATAAATAAATATACTATGAACAA 
ATTGTAATAATAATACTCAACAAAGACAAAGAGAAGATCTTAAAATCAGC 
AAAAAAAGAAAGTCAGACTTAGAAAGAAATGACAATGGCAGACTACTCAA 
CAACAACAATGGAAACCAAATTCAGTGAAACAGTATTTTCAAAATGCATA 
TTTAATCTATCTTTGAAGAATAAGGGTGAAAAGGGTGAAAATTGCTGCCT 
TATACAAAATATCAACATTAACAAAAAGTAATGAAGGTAATATAAAAATG 
TTTTCAAATAAACAAAACTGAGAGAGTTTACCACCAACAAGCATTCATTA 
AATGGACTTTTAAATGCAGTTTTTAGGAAGAAGGAAAACAATTCCTAAGG 
AAGGTCTGAGATGCAAAAAGGAATTATGAACAAAGAAATTGTTAAAATTA 
TAGGTGAATTAAAAAAACTGCCTGCATAAATGATAATAATGACAATGATG 
CTATTAATAATGAGTTGATAAGGATAAAGAAAAGGACAGAATTAAAATAC 
TAGAAAACAAGCATGCTGGAAAGGATTCAGGAATTACTTGAAGGTTAAAG 
TTCTAGGGTCCTTCTATCCTTCTAGAGGGGAGTCAATATATTAATTTTTG 
ACCGTCACTTACACAGTGAAAAACTTTAAGGATAACCATAAAAAAATAGA 
AATAGAGAGTATAACTTCTGAAACAGTCAAGGGAAAAATATGGAATAAGA 
AAACTGACCAAAAAACATCTCAGTCAATCAAAAAAAAAAAAAAAAAGAAA 
GAAAAGGTTCGGAAGGAGAAAATCAAAGCATAGAAAAAGCGGGACAAATA 
GAAGTGGAAAAGAAAAAGGTAGAAGAAACAGGTCCAGAAATATCACTGAT 
GCACTAAATCACCATTAAAAGATGAAAACAAATGAACAACATCAAAAAAT 
TCTAGTGACTGTAGTAGTGCTGATCAGAATAGGCTCTAAGATAAGATGCA 
TTATTGTGAGTCAACTTGTGATGATGAAAGGTTTAATTCACCAGAAAGAC 
ACAATTATAAACTTGTAATCAAATAGTTTTATTTTATTTACTTTATTTAT 
7TATTTTTTTTGAGACAGGATCTTGTTCTGTTGCTCAGGCTGGAGTGCAG 
TGGCTTGATCTCAGCTCACTGCAGCCTCCACCTCTTGAGGCTCAAGCTTT 
CTTCCTGCCTTAGCCTCATGAGTAGCTGGGTCCACAGGCACACACCACCA 
AGCCCTGCTAATTTTTGTATTTTTTGTAGAGATGGGGTTTCACCATGTTA 
CCAGGCTGGTCTCAAACTCCTGGGCTCAAGCGATCTGCCCCCCTCGGCTT 
CCCAAAGTGTTGGGATTATAGGCGTGAGCCACGGTGCCTGGCCTCAAATA 
ACTATTTAAGTGAAACAAAACTAGTATGGCACTAATGAAAAATGTATAAA 
TCCATAATCGCAGAGGGATTTCAACTTACTTCTTTCGATTATGTAAAGGT 
CAAACAGACAAAAGACAATGACAAAACTTAATGCAATGAACACTTTTGAT 
TTAATGAACATATATTGGATATGTACCCAAGAATTAGAGAATACATACTA 
GTTTTGAGTTTATGCAGAACATTTACAAAAATTTAGTGGAAGCCTAAATT 
ATAAAAAGTTGCTGTCACGTAGAATAACACACAAACCCCTGAGTCCGGAA 
TTCAAAGCCCTCCACACTCTCCTCTACCTTTGCATCTTTATCCTCCACCA 
CACTGCAGTGCATACTCTGGGCTACTACTCACTGTTCTTGATTCAAATTC 
CATGTTCTGTCAGCTCAAATCATTCTCTCTGCCTGGAATAACTACTTCAT 
ACATATTCTGCTATTGAATTCTTGTCTTAGCACCCCATCTACTCCAAGAC 
GATGTCCAGTTGGGGTTACTCCCTGTCCCATTTTCTTTGATTACACTTTT 
TTTTTCTACTTCCATTATATTATTGATCACATCTGTGCCACAGTTTTTGA 
CTTTGTGTCTGCTTTTACTCTTTTCTAGACCCTGAGAGCTCCTGAAGGGT 
TGGGTCATTTCTTTTTTATTTGCTCATTCCTCATGGCACAGTGAGTGCTT 
AATAAATGGCTATTGACTGAAATTAAACTGTATCTAAATGGACATATTCC 
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ACTTCTGGGCCATTCATi'CTTTCTTTCTATTGGAACCAGGAGATGfe^AA 
CCATAACAAAGGTAAGGTTGTGCCATGTGAAAGAACATGGAACCTTCCCC 
TGAGGGCCAAAAAAGAGCAGGGAAAGGTGCAAAGACAAAATCTTCCATTT 
TTAAACAATGTAAGAATGTGGTCCACCTCATGCTCAGGTGGGACTTTATC 
ATGACGTTATTTTTGGGGACTTATAGCTGCATCATTTACCCCATATACAT 
TTACCTTTAGTGTAGGGAACTGAGGACAGGAATTTTGTTGATGCAGACTC 
TXGCTAATGAGGCTAACACTTGGAGAATTTTTATCATGCATTCAAGAAGC 
TTGTTTTACATTTCTTCATTAATACTTTAGTTGGTGGTTTAGCTTTAGTT 
GTAGGCTTATCAGATATTTGGAGATATCTTCATAAACGATGGCTTTGGTT 
TTAGAAGAGTTATTCTGAAGCTACTATTTCTGGCAATAATCAAACAGCAT 
GGCCATTTGTTTTGTAAGGCCTTTCCTAAAATATGACGGTAAAATCTACG 
TGTGGAAAAATGCTTATTCTTCTGTCCTCTATAAATGTGAATCTAGTTTG 
TCTTCAAAATGAAATCAAGTGATTAAAATGTAGTTTTCTAAGAAGATAAA 
TGGAGCAAAGCACTCTGTGTTTCACAGTGTTGGAAATCACTCATCCCTCA 
TAAAACTGTCCCAACTGATCCTGACTCACATGAATGAATTAAAATAAGAG 
TTAATAACATCAATTTACATTTTTAAAGACACTTTCCCATGXTTTAGACT 
ATTGGTTGGAAAAGCTGGTAGGTGTACAATTTGTGGAGAGTTGGCTGTTT 
TTGTCTGTCGTTGTTTGACGTATTTCAAAGCCATATCTAATTTTGTTGCA 
GAATGGTCTGAATTCTACAAAAATGTTGAGTTGTGTAGTGTGGAGAAGTA 
CGGAGCCATTTACTGAAAGGCTGGGGGGAAATGACGAGACCCTGAGATAA 
GGCAGTAGTGGTGCGAACAGAGTGGAAGGGAGGTAGTTGAGATATGTTCA 
GAGTAGAATCAGAATGGACATAGTGAACAACTGGATGCAGGTGGGGGCTG 
AGGAAGCAAAGTTGAGGATAATTCTGAGACTTCTAGGTTGATCCACTGAA 
GTTACATTATTCAACACCACAAGGAAACTAGGGGAATGAGAAGGCATACT 
GGTTTGCTTTGGAGTGGAAGGGCAGTGATGTAAGAGGAGTTAATGAGTTA 
AAGTTTGGATATGCCTGAACTTCAATTTGATATGTGCATCTGATATACCC 
TTGGGGTGACCCTCCAGGCAATGGTTGAACATGTGTATTTCTTAGTAACT 
GATAGGCATCACAGACTCACATCAGTAAGGAAGCAACAGCAAACTTGATT 
GGACGATATACCTGGAACTCAGTACCCTATGACTGGAGCAAGTCTCTGTC 
AGTGAAATGAGGATAAGAAGAATCTTGACCTTGTGGAATATGTTGTTAGG 
AATATATGTGATGAACAACATAGGATACTTCCTACAGGGCTCCACATGTA 
GTAAGGGCTTTATAAATGCTTGATAAATATTATTGTTGTAATTTATTTCC 
AAAGTAAGATGCCACTGGAGGAATCTTTGGAACCCAAATTAATAACAAAT 
AGGACTGGATGCAATGGCTCACACCTGTAATCCCAGCACTTTGGAAGGCC 
AAGGCAGGAGGATCTCTTGAGCCCAGAAATTCAAGACCAGCCTGGGTGAC 
ACAGGGAGACCTTGTATCTATGAAGAATTAAAAAAAATTAACCAGATGTG 
GTGGTGCACGCCTATAGTCCCTGCTGCTTGAGAGGCTGAGGTGGGAGGAT 
TGCTTGAGCCCATGAGGTTGAGGCTGCAGTGAGCCATAATTGTGCCACCA 
CACTCCAGACTGGGTGACAGAGTGAGACCCTATCTCAAATAAATAAATAA 
ATAAATAAATAAATAAGTACZAAACCAGCAAACACTAATCCTTTCTAGAGA 
TTATTGAACTCTGGAGGGCAGATCTGAATGGAGCCAGCAGAGGGACCTAT 
GGAGATCAGCCTGGCCCTGGACAGCACCAGGCAATGGGGTTGCTAGAGAG 
GTAATGGGGTTGAACAGGGTTTAAGCCATGAGGTCTCAAGAATCCGTGAA 
GACTCAGACTAATTTTTTTTTTTTTGCATGAGGATTAGGTGTTCCTAGGA 
ATTTCAATGAGAGCAGGGTTAATGAAGGAATGCAGGGTAGGAGAGCTGAG 
GGAAGGCATCTGAGAGAGCCTGGCTTATGAATGGCTGCGTCAGTATGGCT 
CACCTGCTTTCCTTGTATCTACTTAGCAGATGATCCCACCCCAGGCCTCC 
AGGGCCAAGGTCATTTCCACATAGTCATGGGCCCTTGAGGGCCTGGAGCA 
GTGTAAGGAAGACAGAGTCTTAAGAAATTGCATTAACAGTCATGGTGCTT 
GGCAAGTGTCGTCATCCTATGCCAAGCCTGATCTGAAGGGGTGCATGCTC 
ATAGGTAGCTGCTGCCCAAGATTACAGCAGCTTCTTCAATCCCAGATCCA 
TGCTCTCCTATATTCATTTTTCCAGGGGTTCCTGTCCTTCGACAGTGATG 
AGATGCAGAATGACTTATTGAGTTATTCTCCTGATAGTTGCCAACTTTTC 
CAAATGACAATGGGGCATGGAGCTTGAGAGTGGAAATGAGGCCCTAGGGA 
TAGCGTGCTTAGGAAAACACTCCCAGCCTGATGTAATTCTGGGGGTACAA 
TGGCATTTTCATCATCAAGACTGATGTAAAGGGTGACTAGCAGTGAGTTG 
GGGGTGACTCGCACTGGGGCTAGGTTTCTGATTCTGCCTAATCCAGACAG 
AGCAGAAGCACTAGTGGGCTGGTAGAGGGCCTCCAGGGCC7CACTTAATG 
TCCTGGAAAAACAGCTCCAGATTGTTGGTTCACGTTCTGAGGACAAGCTT 
GGGTACTACAGGATAGAGAGAGTGGTGGGAGATGCCGTGGCCTGCCCTGC 
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TGATGCCTGCCCTGCCATTCCTGCGTGTGATGTCTCTGGGGCATCTteCC 

TTCCCTGCCCAGACCTGTAGTTCAGCTGAGGGCATGTGGAGGCCAAATGG 

CTTCTTAGAGTGTTACTTTCCTTGAACAGCTCTGCTGGGAGAACTGGAGG 

AGCTAGCTAGTCACGGTAACTGCAGCAGTCAAAGGATCGTCCCGGTGGAG 

GTGGGGTGGAAAGGTAGAGAAAGAGAACATATAGCGTTTTCCTTGGAGAT 

GTGTGGGCATGTCATAGAGGAAATACCCAATTCCTGAGCCTTGAGCCCTC 

CAGGAAACCTTGGAATATTAGGTTAGTCATCCCCAAGGAAGTCTAAGAAT 

TCTGGTCTCACCCATCTCCTTTAATTCCCACAATGATCCTACATGATATT 

AAGGAACACGGGCCAGTAACCCTCCAAGCAATGGATGTGGTGGTGAAGTT 

TGACCTCATGATGGAGCGGAGGTTGGTTTGAAACCTAAGAATTTAATTTA 

TTGTTTCAAACTGTTCTCCACTCAGCGTTATTAAAGCATACATAATTGAC 

ACATAAAAATTGTATATGTCTACGGTGTACAATGTGATGTTTCGATCTAT 

GTATACATTGTGAAATGATTACAACAAGCTAAATAACATACCCATTCATC 

GTGTTTCAAAGGAATTAAACTCAAGCACAAAAGAGAGGTGCTGTTGAAGA 

GTAGGGCTGCTCTATCTAAGTAGTATGTCTGGGGTTGTCCTGGATCAGGG 

TCCTTTTGTGCTAGTAATAAACCAGCCCTTCTGGGGCTGCTC€ACTTTCC 

CCACATTTTCTTCTGGAGCCTCCCTAAGAATTAGGACATGGCCACTTTCT 

CTGCATAGGCTTCCTACTTCAACAAGGACAGGGCTTGTGCTGCCCCATGC 

CACTTGAGTGTCCCTACAGCACAGAGCTGAGTGCACACTGGCTGAGTGAG 

GAAATCCCCCAGATTAATCTTGGTTCTAAGCATCATGGCTGTATTTCACA 

CGTATATGAATTACAAATTACAGCATAGTCGAATAAGGATTTTTGfGCTA 

CAACTGGAATCCCAGATTATGCAAATTGGATAGTATAATATTGAAATTCC 

TAGGACTTTTTATTAGTTTTAAAAAATTATACAAGCTTAGAGTAAGAAAT 

TAAACAGTGCAAAAGAATTCACTGTGAAAAGTAAAATGCTCTGTCTGTGC 

TGAGAGACAGATATTGCAGCCCAGATACTACTGGGGTCAATAGTTTCCTT 

TAAGCATGCCATTTTGATGGTTTATGGGACTTACAGCTCAAGAAGCTTGA 

CACTAGGGTTGATCTCAGAAAATCATTGTTGCAGGTATTAGATATGACCG 

TCTCATAAAGATACACACACAGACACAGCGATTGGAGATATTCACTGGGG 

CTTATGGGCTGCTTGTCCTTTCTGCTCTGTGCCTAAGTTGGGCTCAGAGT 

AGCCTGGCATCGGCTGTGGGGAGAATGCTGGCATGGGGTTAGCAGGAGCC 

CACTTAACATGTCCTAAGCCACCTGGAAGAGTCCTTCAAGGAGACCAGAC 

TCCAGAGGC CCTA AGGAAGGAAGGACTTTTGCCCGTTTTTAGGTATTCTA 

GTCCCAGAGTTTAGGGAGGAATGGTTTGGCTTTGGGTCGTGTGCCCCTTT 

ACCGAGTGGGATGGGATGTGCCCATGAGCTGTTGAGCTGGCTCTTGGAGA 

AGACAGCAAAAGCGGGAATAAGAGGTCAGGAAGCTGTGTGGTTGTAGGAA 

ATCCCAGCAGAGGGCCTGGGGGTCAAAAGTGGTCATGGTAGTGACGGTGG 

AGGCTGAGGTGGTAGAAAATCAGAGGACAAACCCCATGGGCTGCTGGTGA 

TCTGACCGAGCTCCTATGCTCTCCTGGTTCATTTTAGGCTCTGTAGCAGC 

AGATGATTGGCTGGTGTGAGAGCAGTGCACCTGCCATATCAGGCAATCCA 

AGACAAGTCCAAGCTACGCTGGGAGGAAACCTGAAGGCAGCAGCAGGTAG 

ACTGGCTGAAGACAGACAGGCAGGCAACTTGTCAATCAGATTTGTGTTTT 

TAAGGACTTTTAACTGGGGAGCCCTCCATGACAGATCAGATGAGAGAGGA 

ATCTGGGTCCGCCCATGTGTCAAGCTACCAGAGGGTCCCATCGGTGCTTG 

GATCTTCTTTGAAGCTGGGTCTGAGGTTTGCAGGTAGAGGGTGAGCTGGT 

CAGAGGGACCTATTGCAGAGCTAACCAACACCTTCCCAGGAATGCAAGCA 

CAAGCACCCCACCGCGGGCAGGCGGGCAGGCACTTCTCCTTTTGCCACCA 

GGACCTCACAGAGGCTGATCTGGCTCTGTGAGGTGGGAAAATGGGTTGTA 

CTTAGTACATAGAGATAAAAGGCTTAGGAGGCCCCTCCATCCTGTGACCC 

TGTCCCCAGACCACAGGTGCCGGCAGGTGCTGCTATTTCAAGGCTGGGCC 

TCAGTGCAAGCTTGTGGTTTCTTGCCCACCTGTGATGTCCTCCCACTAAT 

GAAGGGGCTCTCCATCCTCTGTCTGCCTCTAGCAAGTGGAGGCTCTGGGC 

CCTGGGCAAGACACAGGGGGAAATGCCATCTGTTATCCAAATATATTTCA 

ATGTGACAGGAAAGCTGTCTTTAGAGCACAGC 

>Concig52 

GCATGTGCTCTACATTGATCCCAGGAGTTTGAGACAACATTGCAAGACTG 
GGCAACAAGCAAGACTCTGTCTCTACAAAAAATAAAAAAAATTAGTTGGG 
CATGGTGGTACATGCCTGTGGTCCCAGCTACTCCTAAGTTGAAGAGGGAG 
AATTGCTTGAGGCCAGGAGTTCAAGGCTGCAGTGAGCTATGATCACACCA 
CTGCACTCTANCCTGGGTGACAGAGCAAGACCCTGTCTCTAAAATAATAA 
TCGTAATACATTTTTTTTAAAGTAAAACAAAAAAAGGTCACACTTTCTCA 
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TACC^AAATAAATTCCAAATAAATTAAAGGCTTAAACATGAGAAAGTTAA 
ACCATAAAATTACTAGAAGAAAATAAAAGCAAATATTTAGATAATCCTGG 
GGATAAATTTCTTTGGAATGAATTTCCTTAAGATGAATCTCTAAAAGTGA 
AATTCAGGGTTCAAAGGTCTTTTCTTTGTCCTTTTCTTTTCCCTTTCCCT 
CTCCCTTTTTCTTTCTTTCTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCT 
TTCTTTCTTTATCTTTCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTTCC 
TGGTTGCTTGCTTTCTTTCTTTCCTTCCTTCCTTCTTTCTCTCCCTTTCT 
TTCTCTTTCTTTCTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTT 
CTTTCTTTCTTTCTACTTTCTTTCTTCTCTTTCCTTCCTTCCATCTTTCT 
TTCTT7CTTTCTTTCTTTTCTTTCTCTCTTTCTCTCTCTCTCTCTCTCTT 
TCTTTTTTTTTCTGGTGAGACAGGGTCTCATTCTGTCACTCAGACTGGAG 
AACAG7CGCATGAACATGGCTCACAGCAGCCTTGACCTCCTGGGTTCAAG 
CAATTCTCCTGCCTCAGTCTCTCAAGTAGCTGAGACCACAGGCACCCACC 
ACCAAACCTGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCAC 
ATTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCTGCCTGCCTT 
GGCCTTCTGAAGTGCTGGGATTACAGGCTGGGCCTCTACGCGCGGCCGAG 
ACTACCTCTCTTTTAACTGGATCTCTGAGCTCTGGGCAGAGCCCACCCTG 
AATCCTGGTCTCCAAAAAGGGAAAATTATTAGGAGGCTAGACCATATGAT 
GCTTTTACAGTGCACTTAAAAAAAAGTTTGTTTTTTTTTTAAAAGACATT 
TCTACATGTCTAAACTACAATCTTCCTTGAAAACCCAAGAGTAGCTTCTG 
TTGCAATAGCTAGTCAAAAATATAATAGTCAAAAAAATCAGGTAACACAA 
CACAAACGCAAGCAGTTTAAGAGCTGAAATGAACTTGTCTGTTTACACTC 
TAGGGATTCCATAAGGAAAAATAGAAGTTTCTCCCTAAAAGGGAGCCTGG 
CACCTTCTCCATTTTCTTTAAGGAACCCCAGGCTATTATAAACTATTTTA 
GGGCTCTCATGCAGCAGACGGTGCAAGAGAAAGGAGAGACAGCAGAAGTA 
AATGAAGAAAACAGAATCCAGTCAACAGAGAAGAAAAAAACTTTTGCTCA 
AAAAAAGGCAAGTTCCTAGGAAAGAAAAAAAAAACATGAGGGCTATTTAA 
ATACAAAGACGCATACATACACATGC^CACATCTTGGATGTTAGCTTTTA 
ATTAAGCTGACTTTTAACTATTGAGGTCCTTTAAAATAAATCTTTTAAAA 
TCTTATTACGATATTTCAGCTAGGACAAATTGCTGCTATTTCAGCATTAC 
CAAGTATCAAACCAGAAAAGGCTTGATTTAGGAACCAAACCCAGGCTGTC 
GTGGTAGGAAAAAAGGCAGAACGTTAGCTATGGAACCCACAGCATGGGGC 
AACAGCCATTGCTCTTTCAGTATGGCCTGGCTAGCAAAAAGGTGGCCTTG 
TTATGTAAATAAAGCCCGTTTGGTGGTCAAAATGAAACATCTTTTCCTTT 
TTTTTTTTCTTTTGCTGGCCGTTTTTTCCCCCACCATACCACGTTTGTGT 
GTGTGGGAGGGTGGGAATTTAGCCACTTCAGAGGCCTCATTCCCCATAAT 
TTGGAAATTTCCTTTGGATTTGATCAAGTCAGATAGAGTAGGTCAAACCC 
AATGGGAAAAAGACTGAAACAGCAATAAAAACAGAAACAAACAGTTAAGC 
AAAATGAATGATCACACAACTTATATGATTACTGAGTGCTCTAATGGTAA 
GGAGAAATTAAGACCAGCTGGTTGTTAAACTTTAGCCAAGACAAAACCCC 
AATTCAGCTACTTACCTAGGGTTGGGTCTCAGGCTGAAGACCGCTCACTA 
CCGTTCTAGAAGCAAGAAATAAAACTTGAACTCGTCTTACCTGTGTAGCA 
GGACAAGCCGCAGACAAAATCCCTCAGACACCAAATTAAAGAAGGAAGGG 
CTTTATTGGGCCTGGAGCTGCGGCAAGACTCACGTCTCCAACAACCGAGC 
TCCCCGAGTGTGCAATTCCTGTCCCTTTTAAGGGCTCACAACTCTAAGGC 
GGTCCACATGAGAGAGTCGTGATAGATTGAGCAAGCAGGGGGTATGTGAC 
TGGGGGCTGCATGCACCTGTAGTTAGAATGGAACAGAACATGACAGGGAT 
CTTCACAGTGCTTTTCTTATGCAAATAACCGATTAGATCAGGGGTCGATC 
TTTACCAGGCCCAGGGTGTGTCACCGGGCTGTCTGCTTGTGGATTTCATT 
TCTGCCTTTTAGTTATTACTTCTTTCTTTGGAGGCAGAAATTGGGCATAA 
GACAATATGAGGGGTGGTCTCCTCTCTTACCTGCGGGGAGTGAGCTCAAA 
CTCCTTAAAGGAGTTACCTGCCTTCCATCATCAGGGAAGCAGGAAATCTT 
GCCTTCCTTGTTGGAAGCAAGTAAAACTCAAAACAAACAAAGAAAAAAAC 
AGGGAGTTGTACAGCAAAATAAACTTTTGATTTTGACCAAATTTTGGGAG 
ATCAGGAATTCTCTGAAGGAGATGCTTTCAGACCTCAGCAAATTGTCCTG 
TTGGTTTGAGCCATAAAGTTAGCTCATGCTGGTACCAAACACCAGTAGGA 
GATTTGTCAAAGGTAAGAGGCATCTCCACTCAGAATCCCTTCGTGGTTAC 
CAACATGTGAACCTTGGAAATCTGAGACAGGTCTCAGTTAATTTAGAAAG 
TTTATTTTGCCACGGTTGAGGACACCCACCCATGACAGAGCATCAGGAGG 
TCCTGACCACATGTGCTCAGGGTGGTCTGAGCACAGCTTGGTTTTACACA 
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TTTTAGGGAGACATGAGACATC^GTGAATATATGTAAGATGTAmcfiSGT 
TCCCTCCAGAAAGGCAGAACAACTTGAAGCAGGGAGGGAGCTTCCAGGTC 
ACAGGTAGGTGAGAGACAAACAATTGCATTCTTCTGAGTGTCTGATTAGC 
CTTTCCAAAGGAGGCAATCAGATATGCATTTATCACAGTGAGCAGAGGGG 
TGACTTTGAATAGAATGGGAGGCAGGTTTGCCCTAAGCAGTTCCCAGCTT 
GACTTTTCCCTTTAGCTTAGTGATTTGGAGGCCCCAAGATTTATTTTCCT 
TCTACATCACTGTGGGCAGCTGACTAGGAAAGCT7TGTAGGACTGGTGGG 
CAGTGTGAGAGCCCAGTGGGGGGTGGTGGTCCTGTGCCAATGGTAGCAAC 
CACCTGTGAGGCTGAGTAAACTCATTTCCCAACCTCCTCTAGCAGCCCCA 
GTGGAGATACAGATGAAGCAGACTAGCGATACAACCCAGCCTGAAGTTTT 
GTCTGGTGAGTGTAATGGAATAAAAATGGGAAGGGTGCTGAAGAGACCAG 
CAAGAAAATGGTTGAAGAGATGGGGCACAGAAATTAAGCTGGATCAAAAA 
GGACGGAAAAGCAGAAAGGGCCGATAGAGAGAGGGGATATCTATGGGTTC 
GCGATTCTGAAAAGGACAAATCACTGGTGCTTTGAGAAGAGAGAGGGTGA 
GAAAGCAGGAAGGCTGGAGGCTGTCATCCAAGAGGCGGACATCTGTGAAC 
ATGATTCCAAGAGTCACCAGACCATGGGGGTGGCCAAAGGGAGTGCCTCT 
TCTCACCTCCTACTCTTAATTCCTTGTACTCAAGATAATAAGTTCCCAGA 
AGAGAAGTACCCATATTTAATTCATCTGTGTCTTCCTAGCAGTACTAAAA 
ATATTATATGAAAGGTATCAAACCTTTGAGAATGTGTGCTGCTAAATTGT 
TAAGGATGCTGGAAAACTCAAGACGTCCCTGATCCTGAGCCTGAGTATGA 
GCCTGTGGTGAGCCCAATGCAGGTCTCCATTCAGACAAAGGCCTCAGGGA 
ACGGATGAGACCTAGGGACAGAGATGCATGCTGGAGCAGCATTCCCCATC 
CCTACTGCAGCTCAGGCCAGCTGACTGCTTTATGAGTAAACGTTACCAGG 
GAACACTTTGCAGTCTTAACACACATGCCCACCTGTGACCACTGATCCCT 
GTTGGGTGACCACTGACATCAGAGATTCGATGGCAGCAATGAAGACAAGG 
CTATCCTCATTAGGAAGGAAAGGAAGGAGGAGGGAGGAGGGCAAACGAAT 
CTTTCCTGCTTGTCAACCACGTCCATCTCTGTTAGGTGATTTCCCATGTG 
TGACTTTGTTTATCTTTATAATAACTCTGAGAGGTAGGTCTTGATGTCCA 
CATTTTGAACATGAGGACATCCAGCCAGGAAGTTGAGTTCTGGGGACATA 
GCTGAGAGGGCAAAGCTACATATAAACCCCTCTTTGTTTTTTCTGGCTTA 
TCCACTGAGTGCCCCCTGCAATCCACCAGCCCATTTGTGAAGTGCATACT 
ATAGGTAAGTTGGCACAGGAGGAGTGGATGTGGGCGATTTTGTCACAGCT 
CTCCAGGAACTTACACACTGGTGAGGAGGGCCAGGTATGTTCCTGACCAG 
TCACAATCAAAGCAACCTCCTACTAATCAGGGAGGCTTGGTACCTGGGGA 
ATGCTATGTTGAAAGGTTCTTTTCTGGGTTTTAAAATGATGGGTCTATTT 
CCTTATTCTTAAGATTGCTTTTTTTCTGGCTAGAACTTAAAAGAAATTTT 
CAGTAAAATTTCCCTTCCCTGGCACAAAGTGAGCTTGAAATGAATTCCCA 
GGTGGCCTTGATACTTTAAAATATTGCCTCCTATAAAATCAACCTTTAGA 
AGAAGGAAGTCAAAGAACATGCTAGATTTCACAAAGGTTAATTCCTTGAA 
ATCCAGTTATCTACAGGACAATGTTGTCAAAGAAAAAATTATTTGGCCAG 
GCACGGCGGCTCATGCCTATAATCCCAGCACTTTGGGAGGCTGAGGCAGG 
TGGATCACCTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAA 
ACCCCATCTCTACTAAAAATACAAAAAAAATTAGCCAGGTGTGGTGGTGG 
GCACCTGTAATCCCAGCTACACGGGAGGCTGAGGCAGGAGAATCGCTTGA 
ACCCGGGAGGAGGAAGTTGCAGTGAGCCAAGTTCAAGCCACTGCACCCCA 
GCCTGGGCAACAGAGCAAGACTTTGTCTCCAAAAAAAAAAAAAATTCAAT 
GATATTTTTAAATTCATGGTAAGGAAGATTTCATTCAGAACCAGCACAGA 
AGATATAGGAAACACTGCAATGGGACTTTGCGGTGGGGGAGAGAGATTGA 
ACACAACTACATATACAGCACGGGCAAGGACATATTCATAGCCAGGAAGC 
AGAGCAAAGATCAGTGGATGCGAAATTACTAAGAGGAAACATGAAAAATA 
AGGGAGCTTCTGCCTAAACCCACCTAACCGGATCCTTGCTGAAGACAGGA 
CAGGGTGATTGGACACCACTTTGGGGATGGTGGAGGATGGGGAATCCAGT 
GAGATTTCAAGGGTGATCAGATATTGAACATAGAAGGTTCTTGCTAAAAA 
AGGAGTTTACAAGAAAGTGTACAAATGTGCCTGGGAGAAGGTTCAGGAGC 
CTGACTAAAATTTGGTCAAGCAGAGAATATTTGCCAAGATAATAGCTAAG 
TCTTCTGACAAACAATAGATGCTAAGCCAGCAAGGGTGATGTGCTCAGAG 
AAAGCACTGAGGGCTTATTTCCTTTTCCCCCAATCTCCACTCAGTCAAGT 
CTAGTCCCCTTGTCAATGTAGCCATTTGTAAGAATGCAATCAGGCAGGGT 
CCCATCTCCTAGTGACAGGACTGACTGAAGTTCTGCTGAAGAGAGTGGCC 
TGGGGCTGACACCGAGATTTCAGAGTCCTGGGTT7CGCCGAGAGCTCAGT 
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gtagtgccatgccctctctccacctgaacgcccagtgtgggcaGgaacaa' 
ctgcagctagaagtctggcacttacgctggggtctaagacctgcc^gatc 
tgctaactagtcttgtcccttggctataaactgacgttggcacctggcca 
gaaagatgagcaagagatctctgacacacctttaagtccctg7ggagtag 

GATTATGTTGGGGAAGGTCATTCTCTTGACTGAGCAGCAATTTCAGAAGG 
AAGTCCCATGCCGAAGTGAGAGAAGGCAGGGAATCCTGCCTAGTCAGCTA 
GASCAAAACAGTCTGCAGGACGGGACCCAGGGATGTGATCCTCCCATCCA 
AAGGCACTGAACTAAATGACTAAAATACTTTCCAGGGCTCACGTTCTTTG 
AAGAATGGGGACTAAAACTAAGACAGGAGCCAGCAAGTGAGGACTTGGAA 
GGAGATGGCTCATCTGATCAGCCTCCACTCAACAATTTTAATCATCCACA 
CTGGCATGGGGACACAATATGAATAAGTTGACAGGGACCTACTCTGATTA 
AGCAGTGGGCTAGTGCAGAGACCTGTCAGTCAAGAGTGGACAGGAGATGA 
TTTCAGACAGTGAGAACAAAATTAACAGAGTCATGTGCTAAAGGGTGGCT 
GGAACTACAGAGGAGTTTAAGACTCAAGAGGTCTGGCTGGGCGCGGTGGC 
TCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACAA 
GGTGAGGAGATCAAGACCATCCTGGCTAACGCAGTGAAACCC-€ATCTCTA 
CTAAAAATACAAAATATTAGCCAGGCGTGGTGGCGGGCACCTGTAGTCCC 
AGCTACTCGGGAGGCTGAGGCAAGAGAATGGCGTGAACCCGGGAGGCAGA 
GCTTGCAGTGAGCCAAGATTGCGCCACTGCCCTCCAGCCTGGGCGACAGA 
GCGAGACTCCGTCTO^AAAAAAAAAAAAGACTTGAGGGAGTTGTTTATT 
TTTGTTTTCTTTTTAAGACAGGGTCTTTGTTGGGCGCGGTAGCTCACGCC 
TGTAGTCCCAGCACTTTGGAAGGCTGAGGTGGAAAGATCTCTTGAGCCCA 
GGAGTTTGAGGCCACTCTGGGCAACATAGCAAGACACCGTCTCTACAAAA 
AATGTGCAGGTTGAGGCTGCAGTGAGCAGAAAAACACCGCTGCACTCTAG 
CCTGGATGACAGAGCGAGACCCTGTCTCGGAAAAAAAAAGAAAAAAGACA 
GGGTCTCGCTGTGTCACACAGGCTGGAATGCAATGGTGCAATCATGGTTC 
ACTACAGCCTGGAACTCCTGAGCTCAAGCAATTCTCCTACCTTGGCCTAC 
CAAAGTTCTAGGACTACAGGTGTGAGCCACCACACGTGGCCTCAGGAGAG 
ATCTTAATAATAAAAGGACAAATTGCCTTGCATCCCTTAGGGGCAGGATT 
GACACATCCAAGGATCAGGCAGAAAGCCTGTGCGGAGTGGGATGAGCAAA 
GAGAAAGGCTGAGAGTTGTGAAGAGGGAGATGCAGTGCCAGCTAGGACAG 
GCCTTTTTGGGCTATGGGAGGTTTTCAGAGGAGACCCCACCTAAACTAAC 
CCATAACATTGCAGTGGGGACCTGTTGAAGTCATGGACTACTACCTGAAA 
GCCAGAGAAATGGGAGGAGCCTTTCCTCTGAGGAGGGACTCTAGTCCATA 
GGTATCTTGCCACCAAATACATGGACAGGCCCTGGGGGAAGATGGTGGTA 
GCCCAGCTGGAGGAAAACCATTTGCCACCTGAACTAGCCCAGGGTAAGCC 
ACCCAGGCACTGAGGGTGCACACCCATGCATGCACACACAGAATCACACT 
CCTTCCTATTATTCCTCAATTCAGGGGTCTCAACACCCATTTTTTTTGTT 
TTTGGGGTTTTTTTTACATGTTTACATTTTATTTATTTATTATTTTGTGA 
CAGGGTCCCACTCTGTTGCCCAGGCTGGAGCACAGTGCAGTCGTGCAATC 
ATATTAGATTGGTGCAAAAGTAATCACGGTTTTTGTCATTAAAAGTTTTG 
CCATTACTTTTAATGATAAAAACCACGATTACTTTTGACGCAACTTAAAA 
GCTCACTGCAGCCTCAAAATTCCTGGTCTCAGGGAATCCTCCTGCCTCAG 
CTTCCTGAATAGCTGGGACTACAGGCACATGCAATCCTACCTGGCTAATT 
TTTTAAAAATTTTTTTTGTAAAGATAGAAAGTCATTTTGTTGTCCAGGCT 
GGTTTCAAACTCTTGTCTTTGTGCCTCCCTCTGCCCTGTGCAAGACCTTC 
TGGATGCCCACTAATGAAGACTTCCAGGGAGAGGAAAAGTAAACATAGGT 
CCCTGATCAAGGGACCAGGGTTTATCGACCACAAACAGCATGCCCAGATT 
CCACTGGCAGTCCTAGAGGTCGCATTTGCCCCAAGTGTGTGTGGAAGGCC 
TCTCCCTAGCAGTTGGTTTATACACCAGCCACAGCACAGCATATTCTCTT 
AAATTGTGAACATTTGCAAAAACTCCTTGAGGACAACTATCATGTCTTGT 
GTACTTTTGTTTTGTTTCCCTTCCCCTATGTACACGCGCGCGCATGCACT 
CATGCACGCACGCGCGCGCGCACACACACACACACACCCCTCAAACTGAA 
TGCCTGGTGTGCTGAATGGATGAATGGCTAATGTAAGTCATTCTAAAAGC 
TACTTTCTTTGGCATACCATCACCTTTGATTTCATCTTTCTGGAACTCCT 
ATGTTCCCAGATGAATTTGGAAAGCCCTCAGGAAACATTTCAAAATTGCT 
ATATGGGAGAAATGGGAGGGTCTCTCTAGAAATTTACCTGCCACAGGTAT 
TTCTGGTAAGACACAGCAAAGGTGGCACCACCCATTCCTCGTTACAATGT 
CAATGCCAGTCACCTTCCTGTCCCATAAAACTTTATTAAAGGTGCAGAAT 
TCCCATGGAAGCAGGTGGACACCATCTGCTTCCAGCCAGCCAGGGGAGCA 
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AGG7G7CCAC7G7GCCT77G7GGCAGGAAC7GCGC77C7C7AC7C7CCCA 

CTTTGAGGCCTCTGGGGCTGGCCTGCTGCCTCCTCATTGACAAGGCTGCT 

TACTGAGCAGTTCATTCTGAGCTGGACATAGTGCTTCTGGTGAGTCTCTA 

CT7CTATTTAACCCAAAGATATTCTTTCCTAAGGAAACGCTTTCCTGTCG 

GGGGAGGTTAGCTCCAGATGGAAGTCACAAGTGATGGCATGGTAGCTCTC 

ATCCGTTTGGGTGGATGATATTCACGGAGCACCACCATGAGCCAGTCATG 

GAGG 7G AAC AG7 A7A7GC CAGC C CTGAATCAGGTGCATTG ACAG CAAGGG 

AGACAAGCAAACAAAGCTGAGGTTTGCTGAGGATGTTCAAGACTCACACA 

GCACAGAGGAGCATCCACCACCCAGCTTGGGAAAGGACTTGTTATAGAGG 

GGGTGAAGCATGAGCTGAGTCTTGAAAGACTAGAAATTAGCCAAACTACA 

AGGAGGAGAAGGAGTTTCCAGTCAGGAAGAACAGGTTATGCAAAAGCACA 

GAGACTAGAAAGAATATCACATTCAAGGAACTGCAAATAGACAGGAAAGA 

TTGATGCGTGGGATAGGAGAGGAGGGCAGGGGATTCCAGGTGGGCCCTGC 

TTGCCACACTCAGGAGCTTGAACTTATCCACAAAGGAGGTGTGGAACCAG 

TAATGAATGGGTTTTGTGCAAGGGCTTCATGTCACCAGATTTGCTTTTTG 

GAGATACTTCTGTGGCTGATATGTGAGGAAGGGATGGAGGAAGTTTCCGT 

GGCAATCAGGAAAACCAATTAGCAGATGATTCAAATGGCCTAGGGGAAAA 

GGGAGGAGGACTTGGACTACCATGCAGCAGCAGAAATGGAGAGAAATAAC 

AGATCCCAGGCACTCAGGAAGCGCTCAGAATGAGCCCTTCAAAGAACTTA 

TGGTAGGTGATGGATGGATGGAGTGTGAGTCCTGGGATAGCATTGCCTGG 

GAAAATACTTTCTAGTTGAGACAGGGAAGTGGGCCAGCAGAAATGGAGGG 

C77C77C77777GC777AAA7AC7777A7AA7A777GGAAC777GAAAA7 

GAGCAGATATATTAGCAAAAAGCCTAAAAGGGATATTTTTGAAATCACTG 

CTAGTTCTAACATATAACTTTCAGCTTGCACACATCATCAATTAACTTTG 

ATAGCGCCTTTCTGAAACTATCATCCCAAATAGCAATCCTTGTAAAAACC 

TATTTTGAAAAACGGGCCTTGTAGGATAGCCTCACAGATGTTTTGTGGTA 

GATTTTCTAACATTCTAATGTCAGGGAGTGAAAGGAATCCCGTTAGAAGT 

TGGAAAATTCTGGAATCTCTATTCATGGTATTAAAGTTTTGCCGTCACAC 

AAAAGTTTAACACCTTTACACAATCAGACTTCCTCATTTTACATTGCTCG 

GTAATTAGAGGAAATCAGTCACCCAGAGCCTGGGTCCTAGACTTGACAAA 

ATGCACCCAACAAATCCTGAGTGGCCTTGCTGAGGACTTCTCCCAGAAGA 

TAGAAAACTCAGTTCCAGCCAACAAGGGGGAAGCAGCTGAAGAAGTGAAA 

TTAACAAAGTCCTGGAAGGAAATGACCAAATCATCTTTGATTGTGTAATA 

ACCAGAGAGTAGAATACAGCTACGACAGACATTTTGGGAGAGAAGCATTT 

TATCATAGCTTTTAGAAGAGAATATTTTTCAGCATCATAAGCACACAATT 

CCAAGACAGATACTTTCAAGGGATTGTTTTGACG 

>Contig53 

ATGTTNNGGTTTTGGGACCCCATTCAAACTTCATGTTGAATTT7AATCTT 
CAATGTTGAGCGAGGTCCTGTGGGAGGGTGATTGGATCATGGGGGTGGGT 
TCTCCCTTGC7GTTCTCAATGATAGTGAGTGAGTTCTCACAAGACCTGGT 
TATTTGAAAGTGTGTAGCACCTCTCCCCTTCATTCTCTCACTCGTCACTG 
CTCCGCCATAGTAAGATGTGTGTGTTTCCCCTTTGCCTTCCGCCATGATT 
GTAAGTTTCCTGAAGCCTCCCAGCTATGCTTCCTGTACAGCCTGTAGAAC 
TGTGAATCAGTTAGACCTCTTTTCTTCATAAATTACCCAGTCTCAGGTCA 
TTCTTTATAGCAGTGTGAGAGTGGATGAATATAGTGCCATATGTTTGTAT 
TCCCAGCTACCCAGGAGGCTGAGGTAAGAGGATTGCTTGAGCCTGGGAGT 
TTAAGGCTGCAGTGAGCCATGACTGTACCACTGCTCTCCAGCCTGGGTGA 
CAGCGAGACCTTGTTTCCAAAAAAAAAAAACCCAAACTGTGTAAAATGTG 
TTCATAAAAGTGTCTTGCTCCCACACCTGTCCCTATATATCTTATTCCTC 
AGCCTCCGACAACTACTTTATTCATTTCrrTATGTATCTTCCAGAATa^ 
AAAAAAAAATCAAATACAAGCACAGTGGAATGTATTGCCCTTCTTCCCCT 
CCCTTTTGTTACATCAGAGTTAGCATATCATAAATACGGTCTGCATTTTC 
TTCTTTTTCAGCTATCAGCATGTTTTGGAGAGGATTTCATATTCGTGCAG 
ACAGCATGTATTAGTCAGTCCTTGCATTGCTATAAGGAAATACCTGAGAC 
7GCATAATTTATAAAGAAAAGAGGTTTAATTGGCTCACAGCTTCGCAGGC 
TGTTCCACAGGAAGCATGGCAGCATCTGCTTCTGGGGAGGCCTTAGGAAG 
CTTTTACTCATGCAGAAGACAAAGCGGGAGTGGATGTCTTATATGGCAGG 
AGCAGGAC7GAGAGAGAGAGAGAGAGAGAGAAAGGATGCCACATACTTTT 
AAACAACCAGA7C77G7GGGAAC7C7G7CACGAGAACAGCACCAAAGGGA 
TAG7GC7AAACCA77CATAAGAAC7CCACCCCCATGA7CCAA7CACCCCA 
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CACCAGGCCCCACCTCQ^ACATCGGGGATTACAATTTGACATGAGATTTG 

GGCTGGGACACAGAACCAAACAATACCAGAGTGCTTTCTCATTCTTTTCT 

ATAGCTGCCTAGTATTCTATGTCCTTTACTTCATTTAGGCAGTCTCTTGT 

TGATAGACACTTGGGTTACTTCCAATTTTTCCTATTACAAATGATGTGCA 

ATGAATAATTTTGATCATTTTCCATTTCACATGGGTTATGTCCATCTGTG 

GGATAAATCTCCAGGAGTGAAATTGCTGGATCAAAGGGGAAGTGCACTTG 

TGATTTTCATAGTTAGCAAATTTTGTTCTATAAGGGTCATATCAATTTAT 

AGTC C GAC GCGTAATATTTAACAGTGGGGATTTCC CGACAGTTTGACCAA 

CAAGGTCTGTTGTTAAACTTTTGATTTTTGTCAATCTGATGGGAAAATAC 

TAGTATCTCAAAGTGCTTTTAATTTGACTTTCTTATTACAATGTTAAGCA 

TCATTTTACTCTGCCCAAGATCAAATAGTATTTTCTTTTCTGTGAACAGA 

CTGTTAAGATCCCTTGCCTCTTGTTTTGCTGGATTTTTGTTCTTTTTTTT 

CAAATGTTTTGAGGCAGTTCTTTACATGTGAAACAAGTTATCTCTTTATC 

TGGGGTGTGAGTTACAACTACTTTTCCTCTGGCTTGTTTTGCGCTTTGAC 

TTTGCTTCTGGTGATTCCCGCAATTCTGAAAGTGTACTTTTTGCATCATT 

CATTCTTATACACCCATGCTCTTGTTCACGCTGGTTCCTCTAeCTGAGGG 

CTTTTTCTTTTCTTTTCTATCTGGGAACATTTTTTAGAGACAGGGTCTCA 

CTCTGTCATCCACGCTGGAGTGCAATGGTGCGATCACAGCTCACTGCAGT 

CTTGAACTTCTGGGCTCAAGCAATCCTCCAGTGTCAGCTTCCCAAGTAGC 

TAGGACTACAGGTGCATGCCAGCATGCCTGGCTGATTGTTTTATTTATTT 

ATTTATTTTTTGTAGAGATGGGAGTCTCACTATGTTGCCCAGGCTGGTCT 

TGAACTCCTGGGCTCAAGCGATCTTTCTGCCCCTGCCACCCAAAGTGCTG 

GGATTACAGGCGTAAGCCACCATGCCCAGCCCATGTGTGGAAATCTTCTG 

TTTATCCCTTTAGGCTTGATTCTTATGTCGTTCTCCTCCCTCCTTCCTGG 

CTACTCCTCTTGTTCTTTATCTTACTCTACTTGTCATGTTACCTTGTTTC 

TGCTTATAACTAGCTGCCTCTCCTATCTGAGGAGGGACTTGTGACTGTTC 

TCATCTCTGTACTCCCAGGTCCTAGTACATAGCGCTTGCTCAACAGATGT 

TTGGTGCATTGATAGATAAATCAATGGTAGCTGTTAATACCAGTCCTGAC 

TCCCTGCAGTGCTTCAGCTGATCCTGTTCCAGATGTGCACTGAATATCTT 

TCTGTTGAACAACAGAAATAAAGGGGATGGGTGAGGAGGATAGTCTTCGG 

TGGCCAAGGATATTTGTAGGTACTTTGCAGCACTCAGCAATGAGGAGTGG 

GCTTTAGTCCCCCAAGAACTCTCACAGCCCTGTTTGTCTTTACTGTTCAG 

TGTCAAATCCAAGACAAGTCAATGATCAGGAAAGACCTTTTTTTTTCTTC 

AGTGAAGTTTATTTCAGAACCATTGAACAGTATGATATTTGCTCATTTAT 

AAATATTCCCATTTAAATAATCTGAGCTTATATATTTTCAGTCTTAATTA 

AAGGACTTGATTTAAAGAGAGCACACCAGTCCAAATTGAATTGATTCCAT 

AGCTATTAAAAACTAGGCTCTTTTACAGACACTGCTACTTCTTGCCCCCT 

TTGAATAAATTAGACCAATGAATAAAACAAACAAACAAATAAATAAATAA 

ATAGGGAAGCGGTTGCTCATCAGAATGTGGGAGCGAATGACAGAGGGTTT 

CTTAGAACCAAATGTGGCCGTGGTTTCTGTCAGGCGGGCTTTAAGTGAGT 

AGGAGAGGTGAGAGAGGCCTGGCTCAACAAAAGGGCTGGGGATTGGCCCT 

GAAAGGAGAGAGCTGACTGTCCTGGCTGATGGACAGGAGATCCTCTTAGC 

ACTACCCTAAGGCAGGCAGTTGGGCATTGGTGTAGACAACAGGAAAGTCC 

AGGCTATAGCCGTACTCAAAAACCTTTCTGTTCCCTTTCTGCCAGCCCTA 

GGGATTGAGTCCACATTCAGCACAGGACTCTCTGGGTACAGCTCTCTTTA 

GGAAGACACAAATTGCATGGTGAAGTCAGTTATATCCTGGCCGCCTTTGG 

TCCCTCCCAGGAAGACGGGCATGTTTTCTGCTTGAGAGGTGCTGATGTAC 

CAGTTGGGGAACTGGGCAGACTCAAATTCCAGCTTGTTATTGATTTCTAT 

CTTGTTGAAGACAAATCGCTTTTCCATCTTCTTCTTTGGGTAATTTTTGG 

GATCTACACTCTGCAGCGAAAGAGAAAGAAGAATTTTTGTGGGGCAAGGG 

ACAAAAATGCTATGGGAAAGATGTTCTTTGGGTTGGCCAGAAAGGAAACT 

GACGAGCAGGTCACATGATCAGGAGCCACACTCCTGAGTTGTAACTGGGC 

CCCCAACTTTCTGTGTGATTATTAAAAGAGCCCTTCTTCTTTTCTAAAAC 

TTAGTGCCAAATGCTGAGGAGCATAATGTAGGTGAGAATTTTTTTTTTTT 

GGGGGGGTGAAAATTAAGCTAGAGCTTCTTGAAGTACCTAGTTTCCAGGG 

GCTTTTTATTGTATTTTTCCTTATGGTCCTAGAATGACATCAACTTGGAA 

ATGAAGCTTTTGCTGAGAAAGCTGGAGGTGATAGTGGTGGTGATTTTGGG 

AGTGGAGTGGACGTGATAATGGGACCCTTTAAGTCATCTATTTCCCAAGG 

TGTCTATCAAATGAGAGCAGCCCTAACAATATATAATCTGTTGGGGTTGT 

AACTATGGTAGGACATAATAACATCGGCAAAATGATTTAATTTT CTG CAG 
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CAGGATTGAAGGTTGCAuGCAGTTAAAAr* rTATGTTAAATTTATTTACAT 
TAATGCAAAATTGTCAAATAGACCTGTTCCCAGCTTTTCCTAGGGATGGG 
GGCGGGGAGAAGGTGGTTGTCTGGGAATAAGTGGTAGCAGGAGGCTGAGA 
AGGGCTTCATTCCATAGCATTCACTTACCTCCAGCTGTAGAGTGGGCTTA 
TCATCTTTCAACACGCAGGACAGGTACAGATTCTTTTCCTTGAGGCCCAA 
GGCCACAGGTATTTTGTCATTACTTTCTTCTCCTTGTACAAAGGACATGG 
AG^ACACCACTGAAGAAAGAAGGGGGTCTTGTGGTTAGGGACACAGCAGT 
GCAGGGTCACCCCAACCCCTAGGCCCCATGAGTAGGATACATGTAATTTG 
GTAGCCTCTGTGGGAACCCACAGTGAGGTTCCTTGGCCTAAGACACAGGA 
TAACTTGACTTCTCACAGACAATAGCAGGGTCATTTTGTTGATTTAGGGT 
TTCCCCTCAAAGGCCTGAGGGTTTCTCAGAGCCTCATAGCAGTAGGAACG 
GAGAATGAAAGAGGGTCTACATTTTAAATGCTGAAGGAAGGAAGGAAGGA 
AGCCATTGTGTCACTGGCTGGCAATGTGCCCATCCACAGGAGCGGAACAA 
CTTGATCAATGTGGAAGGAAAGGAAAGAGGTGAGGCTGTACTTCTGCCAG 
AAATCAGGCACCAGAACTGTTTCAGGAACAGAGAGTAGCCCATGGGAAGA 
AACTGGGAGAGGAGAGGCTGAGCTGGGAAAGTGGCTCCAAAGAGAGACAC 
TCATTTTGATCTTCCTCAGTCACAGCAGTGTCAATTGGAAGGCCCTGGGA 
TCACTCTTACTACCCGATTCCAAAGAAACAGGATTTTCTTGGCCTGGCTG 
AGAGCAAATAGCTTCCCCCTTGAGTGAGGCTGTCCTTCAAAGTCAGCAGC 
CTTAGTTGCCCACACTCCTGTGCAGAGGCTTTGGCTACTGTGGCACGATG 
CCAGGCAGATCACCACAGCTAATGATGGGTTCACCGCACTTGAAACTTTT 
GCCCGTTACAGCGGAGAGATATAAGTTCCTGCTGGGCGGTAAAATTTCCC 
TACAAGGAACCACCTGGCATTGGGTGGGACGGATGTTGGGGCAAGGGGGG 
AAGACTGGGGAGGGGGATGGACACATTATCGCTCCAGCACTCTTGTTTCA 
GCCTCAACAACAGGAAGAGAGAACCCACAGGCAGTTAGGCCATGTCCATC 
AAATGACCCCATATTGTGGAAGAATTGACATTGCACTATGCCCAAGAGAC 
TTGGGTGGACATGGTCCTGGGAGTGCTTGAGCCGTCTAATTTCTCAGGGT 
CACACTCCTGTTAACAAATGCACTGGCCAGTGCAATCAAATGTGCCATTT 
CTAGGACCAAAGTTTGTATATTCCTTTTTAATATTTTTTTTCACTTGTGT 
TGATCATTTGCCTTAAATTAACTTTCTACTTTGTTTAAAACATGGAGAAT 
TAGCAAGCTGCCAGGAAGCCAGGCAGGGAAACCAGGATGTTTCCATTTAC 
CTTGTTGCTCCATATCCTGTCCCTGGAGGTGGAGAGCTTTCAGTTCATAT 
GGACCAGACATCACCAAGCTTTTTTGCTGTGAGTCCCGGAGCGTGCAGTT 
CAGTGATCGTACAGGTGCATCGTGCACATAAGCCTCGTTATCCCATGTGT 
CGAAGAAGATAGGTTCTGAAATGTGGAGCACATGTTGTTTAGGTATAAAA 
TCAGAAGGGCAGGCCTCGTGAGGCAAGGTGGCAAAATTTGATTTCTTGGA 
GGACACCTGAGCATATACGGTCAAAGTCTGATGACAACACCAGTAGGGAT 
GAAGCTGGGAGTGGGGTGGCTAAGAACACTGGACCTGACACTATTAGACA 
TGGGTTCCAGCTTCAGGTCTATTACTGCTCACTGTGGCCGAGCAACAGAG 
CTACTTAGGTAAAATGGTGATGGTCATAACACTAGCCCACAGGGAGGTTA 
CGAACCTCTGGTGACAATGTAAGTGAAAGGCCCCTGAGAAAGAGTGAGGG 
AGTTGCAAATGTCAGTAGCCATCAAGATCTTCTTTAAGAATAGTTTCCAC 
TAAAGAGATGATTGCTTTGGTTTCCAGCCTTCTTTGTTTTGTCTCCCCGC 
TGGGCCTTCTACCTTTAAAGGGCTTTGGCTCTGGGGGAATTGAGTTGGCT 
GGGGCTTGATGACTTCCAAGAGGACACAAGTGGAGATCTACTGCCTGCTC 
TTGGCTAACTACCTTCTTCAAAGATGAAGGGAAAGAAGGTGCTCAGGTCA 
TTCTCCTGGAAGGTCTGTGGGCAGGGAACCAGCATCTTCCTCAGCTTGTC 
CATGGCCACAACAACTGACGCGGCCTGCCTGAAGCCCTTGCTGTAGTGGT 
GGTCGGAGATTCGTAGCTGGATGCCGCCATCCAGAGGGCAGAGGTCCAGG 
TCCTGGAAGGAGCACTGCGGAGAGAGCGAGGGAGGGAGCCTGGTGAGGTG 
GTCCTGCCAGGAACCATGCTTTGACATCAGAGAGTAGAAAGCTCAGAGAG 
GAGGAAAGGGCTTGAAAGAATCCCGAGCTTCTAAAGATCATCCCTCTCTG 
GGCCAGGCGTGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAAGCCGA 
GGTGGATGAATCATTTAGGTCAGGACTTCAAAACCAGCCTGGCCAACATG 
GCGAAACCCCTTCTCTACTAAAAATACAAAAATTAGCTGGGTGTGGTGGG 
GTGCACCTGTAATCCTAGCTATTCAGGAGACTGAGGAAGGAGAATCGCTT 
GAACTCAGGAGGTGGAGGATGCAGTAAGCCAAGATTGTACCACTGCACTC 
CAGCCTGGGCAACAGAGTGAGACTCTGTCTCATAAAACAAAACAAAACAA 
AACAAAACAAAATAAAATAAAATAAAATAAAAAGAT7ATCCCTCTCTGAA 
GCTCAAGGAGGTTAAGGGTGTACTCAAGGGCACACAGCAGGTTAGAGGCA 
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GACTCAAGAC2TAGAATGTGGGCTTTCTCJACACCTTACAGGCTATTdTTTT I, 

AGAATAAATCCCATTTCTACTTTGTTCATCTTTTTTGTACATGCCCCACC 

TACACCATACATGTATACCTTCTCTATATCTT7TTGTATCCCTAATGCTG 

TCACACTATGATTTGCTTTTTCATGCAGATGACCATAACATTTTCCATTC 

ACCTATGCTCACTCAGCAAGTATTCAATTTTTCTACACTGTTCTTTTTTT 

TCCTTTTTCATAACACTGTCTCATAGGCATTCTGCAAATCCTGTGAGAGT 

ACJTTTTGTGAAATGTTACCACTTTCCTCTTATTCAGAGAAGCTCCGTAT 

TAAGGCTTCACTGAGGTTGCCTTAAGGCATGATAATGGTTCAAAGGCTTG 

AAAGACAGTTAAAGAGACCTGTAAGTGCACAAAAGAAAGTTGAGCAGGAG 

AGAATTTCTTGCCTGGAGCAGAGCCAAGCTACTGGAAGAGGCAATGGGGG 

CAAAGGCCAGGCAGACAAGCCAATGGGCTCCTCCCACAGCTGCAGCCAAC 

AAGTTATGCCAGTCTTAAAACTTCTAAAGAAATATGTTTTTAACAAGATT 

GAGGACTGGATTATGAGGCTAGGGGAGGCTATCACAAACTGGAATAAAAT 

AAAGCCAGAGAAAAGTGGCTGCCTTCCAACCTGCACAACTGACCTAGCTA 

GGCTGATGGCTGGGCCACCTAGGAAGGCTACTGAGCATCATATAAAACAG 

AAGGGACAGCAGGAATATAACATGGCTCTTTGTAAGGATGAGTCTGAAAA 

ATGACCATTTGCTGCCCAAATGCCCTTAGCTACAACTGAAAATATTTCAG 

AACTGGAGGTTGCAGGATGCTGGAATCTCAGAGATCATCCAGCTCAGCCC 

TTTATTTTTCAGATGAGGTCCAAAGCGGGTAAAATGACTTGTCAAGGTCA 

AACAGCAAGTGAATGGTTTTCTTTCAAGTCTCAATTCATCTTTTTGTTTA 

TATCATCTATGTCTTGTTGTTATAAGCTTCACCCCAGGTAGCAAAAAACT 

ATTCTACTCAAAAGGGGTAGACATATGTTAGTTCTCAAGATCATCTCTTG 

GTTTCAGAGTTTAACTCAAGTGATTGGCATAGGCTGAATCCATCTCTTAA 

AAGGATAATCAAATTTATGTTGAAGACTTGGTTGTCTTCCTACTATGAAA 

TGGGAAACATTATCACTACTCCTCCCCTGTCACCACCAAGTGTGGCCACC 

ACCACCAACGTTAGTGAGTGACTGTGGTGATATGATGACCAAGTGGCCAG 

GTCAGCAAGTGGTGCAGCCTGTGTCTCACTGGAAGAGGTTAAAGTCTTTC 

TAAAACAAAATACCATGGCATCAAAGTGGCCCAGAACTCCCTTCTTTGAG 

CTTTCCCTGTGTTAGAGCCCTTCCTTGGGTTGGGAGTTAAACCCATAGTC 

TTACCTTCATCTGTTTAGGGCCATCAGCTTCAAAGAACAAGTCATCCTCA 

TTGCCACTGTAATAAAAACAGGGACATGTCTCAATTATGTCTTCTAAACA 

GGTTTATTTTTCCTTCCCTGTGTACAAGACTTGACTGTTCATAAGAAACT 

GCAAACAGCCTGCCTCTCAAAGCTGCCTGAAACACCTGGCAAGTTTCACA 

GTGATATGCGCAGAACAGTCCAGAAGGCAGATTCTAGGCCTGGCAGGTGG 

GCACCCTGGGTGCTCCCTGTTGGATCTTGAGGCCTAACCTCTAGCCCAGC 

AGAGTCAGCTAAAATCTGAGCTCTCCCTCTCCCTCCAAGCCACACTTTGC 

AAAGGGATTCCTTGTATTGTGGGCTTGGAATCTTTTCTCCCCATTTGCCT 

CTGCAGGAAGCCCTTGCAACAACACATCTGGATAGCCTCCAGGTCCCAAG 

GCTGGAGGGACTTGTAATGGGAAAGTAGTCTTTAAATCAGATTTACTTGG 

CACCCTGTTTGCCACTGAAAGAGGCAATTTAGGGGAAAAATCTGGTCTCC 

AAGCACAGATAACACTCTACTCTTGAAAGAGGAGACCTGCTCATGTTACT 

GGTCTCAGCGTCTCCACTGACCTGTAATAAGCCATCATTTCACTGGCGAG 

CTCAGGTACTTCTGCCATGGCTGCTTCAGACACCTGTGTAAAAAGGAGAA 

AATGAGTGACTTCCCCATGACGGCTACGTTCATGTGTGATTTCTCTCAGC 

ATCCAGTGCATGGCAGTCATGCAAAGAAATGATCTCTGAGTAAATGAATG 

AATGTGTGAAAGAGAAGTCCTTTGGGTCTAGAGAAAAGCATTTGCTAAAC 

CAAACCCCAACTAGCAATGTATTGGCTAGGAGAGCTGGAGCAGAGGCTTT 

GACACTAACCTTTAGGGTGTCAGCTGTTAGATAAGCAGTATCCATTCCCA 

GAATATTTCCCGAGTCATAAGCATTATATTACACCTGGCATTTTTGCAAA 

AAGCTGAGAGAGGGAGGCAGAGAGGGAAGGAGAGGGAGAGACAGAGAAAG 

AAAGAGAGAGAGAGAGAGAATATGCATACACACAAAGAGGCAGAGAGACA 

GAGAGACTCCCTTAGCACCTAGTTGTAAGGAAGATTAAAGTCATACTTGA 

GCAATGAAGATTGGCTGAAGAGAATCCCAGAGCAGCCTGTTGTGCCTTGT 

GCCTCGAAGAGGTTTGGTATCTGCCAGTTTCTCCCTCGCTGTTTTTATAG 

CTTTCAAAAGCAGAAGTAGGAGGCTGAGAAATTTCTCTGTTGAATACCTG 

ATTTCACAATCAAGTTAAAGGAAAGGGGAAAAGAGTATTGGTGGAAGCTT 

CTTAGGGGAGGGGACTAATAAACTGAGATAATTCTCTGGTTCATGGAAGG 

GCAAGGAGTAGCAAACTATGACACATTTTGCAAATGTATCACCATGCAAA 

TATGCATTGTTTTCCTGACAATCGTTGTGCAGTTGATGTCCACATTAAAA 

TACTGGATTTTCCCACGTTAGAAGAATGTTTAAATTTAGTATATGTGGGA 
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CAAAGTGGAAGACACACAGATTTATACA * GCACATACTTTTCTTCKTT CA ' 

CTTCTTTGTACTTAAGTTTAGGAATCTTCCCACTTACAGATGGATAAATG 

GGTACAATGAAGGGCCAATAGCCCTCCCTGTCTGTATTGAGGGTGTGGGT 

CTCTACCTTGGGTGCTGTTCTCTGCCTCGGGAGCTCTCTGTCAATTGCAG 

GAGCCTCTGAGGAGAAAATTGACCTTTCTTGGCTGGGGCAGAGAACATAC 

GGTATGCAGGGTTCAGGCTCCTGACGGAGTTGGGGCAACCCTGGAGATAA 

GCTCACACAACCCTGCAAGACCAGGTGCTGTTACCCTAGCCAATCTCATG 

GATGAACCAGATCAATGCCAGATGAGCTCTGCCTAAAATGATTTTTTGGT 

GAACTCTGAAAAGTGGAATATTGTTTCTGTAAGAATATCCATCTGAGACT 

CTATCTCTTGGTAATACCAAGAGTTATCAGTTTCTCTTTAACCGAGACAC 

CAGCAAAGTGCCTGCTCCAGGGTACTGCCCAGGGGAGCCCTCCATTTGTA 

GAATGAATGAGAGTCCAGGTTATGAACAGTGCCTGGAGTGTAGGAACACC 

CTCCTTTGCCTCTTTGACAGGTCTGCATCATAACACTTTTTTTTTTTTTT 

TGAGACAGAGTCTCACTCTGTCGCCCAGGCTGGAGTGCAGTGGCACGATC 

TCGGCCCCCTGCAAGTTCCGCCTCCCGGGTTCACACCATTCTCCTGCCTC 

AGCCTCCCCAGCAGCTGGGACTACAGGCACCTGCCGCCACGGXICGGCTAA 

TTTTTTGTATTTTTAGTAGAGACAGGGTTTCACCATGTTAGCCAGGATGG 

TCTCGATCTCCTGACCTTGTGATCTGCCCGCCTCGGCCTCCCAAAGTGTT 

GGGATTACAGGCGTGAGCCACCGTGTCCAGCCTGTAACACTTCTTATAGC 

ACTGAGTTGAAACCTTGCTCCTCCTGGTTCCTCCAGGAAACTGAAATCTT 

TTTGAGCCAAGTCTAGCACAGTGCCTGGCATGTACATTCAGGTGGTAGAG 

TTTGCTGCTTGAATGGGTGAATGGGAATTTGACAGCATTTTTATTCAAAT 

TAGTATGTGCCAGGTATCGTGCTCGCTCTGCATTATCCAAGGGAGTGAGC 

CTCTGTGCAAGTATTTGAGACACGAGGGAAATAGG7TCTACTGTGGGAAA 

AAGAGCATTTCATGGACTTGCTCTCCAAGCAGCCTTCTGATTTTTAATTT 

GGCTCCCAGTATCTTGATATCAGGAGTCAGTCACAAGAACTCCATCTTTA 

GTAAGTTATATTTTCCACAGGAAATCTAAAAGCTGTTCAACATGTTAGTT 

TCCTGTGAATTTGATAAGCCATAATCCATTCCTAACACTGAGCCCTCCTG 

AAATTTGGTGTCTGGTCCTGCAGATAGCTAAAAGCCCTGTCTGGGTGGCC 

TAGGGGACTCCTCTGTTTTGCCTCCACAGGATCCACTTTGCAAATTAACC 

ACTGGTTCTCCCGTTGTAGGAACTGCCACCTTCCTCAGAGCCTGTCTTTC 

TTCCTTCCTTCCTTCCTTCCTCTTTCTTTTTCTTTCTCTCTCTCTTTCTT 

TCTTTTCTTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTT 

TCTTTCTTTCTTCCTTTCTTTCTCTTTCTCTCTTTCTCTCTTTCTCTTTC 

TTTCTTTCTCTCTCCCTCCCTCCCTCTCTCTCTTTCTTTCTTTTTCTTTC 

TTTTCTCTTTTCTTTCTCTCTTTCTTTCTCCCTCCCTCTCTCTCTTTTTC 

TTTG7CTCTCCCTCCCTTCTCTCTCTCTTTCTCTTTCTCTCTCTCTCTCT 

CCTAGACAGGATCTACCTTTATCCCCCAGGCTGGAGTGCAGTGGTACAAT 

CATGCATTCATTGCATGATCACAGCAGCCTCAAACCCTTCCTCAGAGTCT 

TTATGCGGCAACCAGCAGGGTCTGGAGGGTTGGTGGCTCTGTGAACTCTC 

CTGACAGAACACAGAGATGTCTTTGGTCTGTTGATGTGATTACAAGCTGA 

ACGAAGGAGGATCAAAGCCAGTGACAGGAAGGGAGATATGCAAGGGACCC 

GAGCATCAGCTCTGAGTTAGTCCATTCTGCTTCTGGGACTTGGGATACAG 

GTCAGAAACCTTGAGCTTCTACTTCTCCATCTTCCAATTGTAGCATCCAG 

GACCTCAGAATCTGCCAGCTAAGAGGAGCCGTAATGATTGTCTGGTGGGA 

TATGGTGGGACCACAGAGATGAAGACATGAATAGCTATTTGAATGTGAAC 

AGCAGACGAAGAAATCAAGGCTAGGAGGGTGGAAGTGACTCATCCAATAG 

CACAGTGTGGTTGAAGCAGCACTAGTATCCAGGTTGCATGAGCCCCTGAT 

GCTTTCGCTCGAGGGAAATTTTGGAGCCATGGGGCAATGCCCCCTGACGT 

AACAGTCTCCACAGTTCTGCCATGTCTCATCCTGGCCCTGTAACCTGGAC 

CCAAATCTGCTACCATCCCATCCATCTCAGGAAGTGAAACCTCTTATGTC 

AAATAGGTTGTGCAACGTATGTATCAGATCCTGTCTTCCCAAGGAGACCG 

CTCAGGCCACAGCACTTCCTTCCGATCCCCAATGAGCAGAAAATATCTCG 

CTATAAACATAGTTGGCACTAAGGGAGGGAGTGGAAGAGTGATGATGATG 

TAGATGGTGATGTAGCCCCAAGGAAGTGGAACAAGCAGAGATGGGGAGCT 

GGAAATGCCAGGATGCTCCAGCTTTTGGGGAATTATTCAGCTCTTGAGTC 

ACTAAAGCCTTTCTCAGCTGCAAGTTCCTCTTTACCCTGTCAGGTCATTC 

TTCCAAGACAGGAGACTGACATTTATTCAAAGCAGCAAGTGCCCTGATAC 

ZATCTTGTG7CTAATCATGGGCTTCGCAGCCAGTTATCAAGGTTGATCTC 

ATCTCATTGGTCTTCAATCATTTTGAACAAGAAGACAAGCAAAATAATCA 
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TGGGTTAGTTCTTATATTATTGTGTGTACATGCAGTGATGTCTG7TCTTT 
GTAGTGAGCTGTTCCTTCCTTGTTCACCCTCTTGCTTAGAACAGAACTAA 
GCAATCTGCCCCCAACATTTTCCCCAATTTCCCATCTCATTCTTGGCACT 
GGCTTCCTAATATTTGTTCTTATGAGTCATTTTCTTGTATCATTTCCATG 
AGTCCCTCTGGGATCTTAAAGTATGAAAAATGTTGTGTGTACCCACACCT 
GTCTTTGTGGATATTTCTCTCCTTTCCCTTCTGCTTCTGGGATTATTTGG 
GA^TGGGCACTATGATTTTTATCATATCGCTTCCACTTCCTTTATGGCAT 
CATCTCCAATGGGCTTCTTCTCCCTCTTGGATCCAGGTTCTCAGATTGGG 
GACATGCAGAGTCCAAGGAACATTCCATTCTCCTCCCTGGTCTAGAACAA 
GGAGGGCTTAGATATATGAGCAGGTGGCTGGGGCTGGCGAGCTATGTAGT 
CTCCAATGGCTTTTCCCTGATGTCGGAGTTGTTATGTCAGTTCTGGGAGA 
CCAATAAGACCTTGTCCTTCCTTTGGATCCATCAGAAAAAGCCCCTGGGT 
GGGTAAGATGGATGGCAGGGCTCTCCTACTCTATGTCTTTTCTCACACCT 
AGTGGGTATAAGAGAGGGGACCACAAACAGAGGGGGCTCTGGTACCACTT 
ATCCAGGGTCTGGAAACATTTTCTGTAAAGGGCCAGATAATAAATGTTTC 
AGGTACAACTACTCAACCTTGCATC^TTTCAGAAAAGCAGTCAGATAATA 
CATAAATGAATGGGTGTGGCTGGACTTGTCCTGCGGTCCCCTGTCTTATA 
TCATTGTATTATATCATTTTTTCTTACATACAAATTTAGAAGCAATACTT 
AAAAAAAAAAAGCCGTCCTTTATTGAGCACCTACTAAGTGCCAGGTACCT 
TTTTTTCCCTCATTATCTTATTAACTCTTCATAATAACCTTTAAAGTAGA 

taatattgaaccatttgacctatgcagaaactgaggttgagacaatXaat 
tatttaagaccgcacaaacagtaaatgctggaactacgactcaaatatgg 
gttaactgaaccaaaaccagatctttatttctcacttttaattgttagat 
atgtttattgcctcatctcctgtccacatggtgcccatcggcagactcct 
ttctcattctcagtgattgagtgacattctaaactacattggcctggcag 
attcacctctgtcccctaaatgtttccacattgtccttttaggattgaga 
tcctctctgttcccttgtcttccctcctttcttcttctggcggtgacgtg 
ctgtgtgaatttgtttctttctcctctcagggtagtactgggactttcca 
aatcagggtttttaatgatctctcttcncttttctgaatttcttccttat 
tcccattcactttctcatctataagtggcanctttgttgctggaagatat 
cccttgtgcagggattnctctttaanaatttgtcnnnacc 

>Coiltig54 

GTGATCGTCAACCTCCCACCCTGTAGGGCCTCAAGCATTGAGGACAATCA 
CTGGCTGCCCATTAACCCAGAAATGTTGCCGAGACAGGAGGCCGTGGCCC 
AAGTTCCTGGAATGGGGTATTATTATGTCAGCACAAAGGCCTTTGCACAA 
ATGAAGGCTTTAAAAATGCAGTCCTAGTCAGGTGGAGGAGGGCTTATAGG 
ATTCCCAGGAATCTGGATCATTCTCTTGAGAGCTTTCCCTTGTCTCTGTT 
AAAACTCACATCGTACGGCCCAAATAACAACAAAAAATGGATGTAAATTC 
TTGAAATAACTTGTGGATGGGGGAACAAGGCCCACCCCCCAGATCTGCCA 
GAAGCTTCAGGTGAGGGTCCCAAATGCCAAAAAGTCTGGTATCAGAGAGG 
ATGGCCAGTGACNTGGGGACACATGCCCTTTGCTGTGTCACTCAAGGAGC 
AGCAGCTTCGGCCCCGCACAGTGACCAGGACCCTGGCTTCCCACGCTGGG 
CAGGAGCTGGTGTCTGATGAAGGGAATGCCTGGCAGCACGTGCTGTCTGT 
CTCCTCGTGTCAGCT TACCTG GCTTTGCTGCGAAGAGGCCACTTGCATTT 
CTTTATTTTTTATATTT1TITAATTTTTTAAATITTTTATTTTATTTTTA 
TTTTTATTTATTTATTTATTTTTAATT TTT 

CTTTAAGTTTTAGGGTACATGTGCACATTGTGCAGGTTAGTTACATACGC 
ATACATGCGCCATGCTGGTGCGCTGCACCCACTAACTCGTCATCTAGCAT 
TAGGTATATCTCCCAGGTTAATCCCTCCCCCCCTCCCCCCACCCCACAAC 
AGTCCCCAGAATGTGATGTTCCCCTTCCTGTGTCCATGTGATCTCATTGA 
ATTTCTTTAAAGGTGGAATCTCTCAGTGGGGTCTAATCTGTTCAGAAATA 
TCAAAAGAGTATCCTTGGGAATGACTGGAATTCCAGAGTCATCTGGTAAT 
CCTCATAAAACAACTCCTGGATGTCTCTCAGCACATCTCCCACCTTGAAC 
GCAGGAGGCTGGTTCAAATGGAGGAGCATCGCTCTACTGCACTTTTTTTT 
TTTTTTGGCCTAAAGTGCAAAAGGGGATACGTTTCATGTAAATAAATCAA 
CTGCAAATCGCTAGTTATGCTGAGCCCTGTCCCGTGCTGTGGACACAAAG 
GAACCAAAGGCTTTTCTCCCCGCCCAACAGACACATAACACACACACAAA 
AT CATAAAAACATACATAC C C C CAACACAT AACAACACACAACACACACA 
ZAAAATATATACACACAACACACAC CAAACATG C C CACAAAC CTGTGT C C 
AAAAATAAATCCTACTGGTGGGTTTGTGGTCTCCCTAACTTCAAAAATGA 
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AGCCGTGGACCTTCGCACjTGAGTGTTACAGCTCTTAAAGATGGCATGGAT 

CCAAAGAG7GAGCAGTAGCAACGTTTACTGTGAAGAGCAAAAGGACAAAG 

CTTCCACAACCCAGAAGGGGACCCCAGCAGGGTTGCTGGTTGGGGTGGCC 

AGCTTTTACTTCCTTTTGGCCCCTCCCATGTTCTGTTTCCATCCTATCAG 

AGTGCCCTTTTTTCAATCCTCCCTGTGATTGGCTACTTTTAGAATCCTGC 

TGATTGGTGCATTTTACAGAGTGCTGATTGGTGCGTTTTACAATCCCCTT 

GT5AAGACAGAAAAGTTCCTGATTGGTGTGTTTTACAATCCTCTTGTAAGA 

CAGAAAAG7TCCCCAAGTCCCCACTGGACCCAGGAAGTCCACCTGGCCTC 

ACCTTTCAACTCCATAATGGCATGAAAATACATATGTTGTACAAAACATA 

CATACACAAAGTATACATGCATCTCCCCAAATATACACATACCACAGAAA 

CATACACACAGGAACTCAGCTACCTGTCAAAAGTCTGCATGGTGATTGCC 

TCTGCAGTGAGTAGTTAGAAAAGTGAATTTGTTTTTCAATAAATTGGAGT 

C CTTAAAAATCGTTGTAAGATAGAAAATTTTTAAAAGTATATAAAATAAA 

ATATGTATGTCCTTTGGTCTAGCATTTACACATGTAGGAATTTATCCTAG 

TGGAGTAATCAATGATATATGCAAAGATTTGGACAAGCATATTAAGCACA 

GAATTATGTATGCATATGTGTGTGTATATATATATATATCT©^TACATAT 

AATAATGTAAAAGTGAAAATAACTCAGATGTTCAAAATTGAGGATTAGTT 

AGACTATGATCTGTCCATATGTGACATACAAGTTAGCTGCCCCTTATTCT 

CTCGAGCTTCAACCTCCTATAAACAGTGTCCCTTGTATATCAGTATTGGT 

ACAGATAATCGAACTTATTGAGGTTTTACATGGGGCAATAAAGGCAAGAG 

TTTATGAATACTCCATACTACACTAGGTAGCACCCCCTATTAAAGACAAA 

CTCTTCTCTCTCATTTCCCTTCCTTTCCGGAACCACTTGGTTGAATCTCT 

ACAAGTCTCTATTGCAACTGCCTCAACATGGCACCCTCCCTGCATCTCCA 

TCTTCCCTGTCCTGAGAGCAATGGCCTGCTGCCCCCACACTCACATCCTC 

ATTCATTCCAGAAGTGAGCACCACAGAAGTGCCTACAGTTACCCCAACCA 

CCTTCTTAGAAGATAAGTTAGTGTTTGTTTTGACTTTTTAAAATTTTTAC 

TTCCTCTTTTCCTTCACAATCTCATCCCATCCCAAGAGGTTTATCAAGAA 

GTTCTCTAAAGATATGTGTCTCCTTATGGAATTTAACAGAAATCAGGGAT 

TTGTATTCTAGCCATCAAGGGAATAACATTTTTCCAGGTCTTTAGACAAA 

TAATGGAATACCTTGCAGTAATTAGATACACTATTGTAGAAAAGTATTGA 

TGAAATGGAACGATGTTTGAGATATCATATTGAGTAGAAAAGGCAAGATA 

CATTAAGTAGGAAATGTATCTTACAAAATAATTTGTCAGACACACTCCTA 

TATTTGTATGTTATATAAATGCGTATGTGAAGAAAGGCTAGAGGATGAGA 

CCACAGTCTTCGGTGAAGTTTAAGAGATGAGGCTGCAGCATGCTCAGAAA 

GGCCTGGGTTATAGTTCTTCCAGTAATTAAGGATGTGATCTTGGGTAAAT 

TGTCCATCCTCTCTAAACTGCACCACCTTTTGTCTGTAAAACAGGAAGGA 

TGGTATTTACCCCCAGGGTCATCAAAGGATTTGGTTGGAGAAAAATAAAT 

AAATGGGCTGAGCCCAGACCTGGCACAGTGAGAGCACAGTGGTTGACTAT 

TGTGCTGGCCTGTTGTTCCTGTGTTATTGACATGCTGCTGGTGGTGGTCC 

AGAAGCTATTACCTTAATTGGTTATGTGGATTTCCCCTCATACTGAGCAG 

CTGTGTGTGGTGTTGTAAAACATAGCCATACACAGTAACTGACAAGGGCA 

AATGTGATGGAAAAATGCAAGGAAGTGCAGATAAATAGCTAATGGGCTGT 

AGAAGGAAGCTAGTCCTTGGAGGGCTTGATCAAGGAAGGTCCTTTTGCAT 

GTCACCTTTGAAGAAGAGGGGACATAGAAGAGGTATAGTGCATCCCGGAG 

TGTACCTGGAAGGGAACATGAAAAGAGGACATTTTTCTCTGGGACATGGG 

GACTCCACTTGCATGAACTCTGGAATTGGGGCAAAGAACCATCATGAGAA 

CAAGGGCTTCCTTGAACCTCCCAGGCTCATTGGCTGATCTAAACCCTGTG 

TCCCCTCTTTCCTTCACTCTCCTCTGTTTTCTATACCTGTATTATTGGAC 

TGGACTGGAAGCCACCTGATCTATCACAAGTACCTTGAAATGTGTTGAAT 

AGGTGTGGCACAGTCCTTAGCAGAGTGGCACTACCCCCACAGGAATTTGT 

TTATACCTTTGGCATGGAAAATAGCAGGAAATGAGTGATCACTGATAACT 

GAGGATGCTATTTATTATTGGCCAAAGGAATACTTGTGTTGTATTTGCAT 

AACCACTCACAAACTGTTGATTACAAATGAGTACCAGACCTAGCTCCTTC 

AAGT AAAGGAT CCTGAGAACTGAAGGCAAACAGAGCT C CAGGAGT C CAAG 

ACAGAGCCACAGACCACGAGGATCCCTGGCCCAGGTAGGTGGTCCTCCTG 

CACTGGCTTTCAAGGCCAACAGGATGGATGGGGAAGTAGAGTAGCATCTG 

GCCATCTAGACCCTTGCTTTTTATCCCCZACTGGAAGCACATCTGAATTTC 

TAAATATGATCTCTGAGACCTGCCCAGAACACCTTGCTCTCAGCCCCAGT 

AGCAGCCTGCTCTCTCCCAGGAGGGCTTCCACTAACAAGTAGGGCATTGC 

TGGAGGGCCAGGCAGACACTAGCTTAGGAAATCCACCAACCCTGGAAATG 
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C7AG7CCC77CTC7GAAGGCTCAGAAGAC7GACT77AGAG7C7AGAAAA7 
ATTGGTCCTTGGGAACAGATTTTGAGTGCAAAGAGATGGACTTCAGATGG 
CCAGATGCACTGCTTCTTTAGGGAATTCTGTGAAAGCTCCC7GCATTTAT 
CTTAATACAGGCAGCAGATTTCATGAGTACCCCCGAGGGATGGCCCCAGG 
TCCTCCAGCCTGTGAGCATCCTTCTGTCCTTCAGCAGCACCACAGTATCT 
TTATATG7CTTTGGATACCTACGTTTCTGCCAGACATCTCTTGCTCTGAT 
GXTCTGGCTGCCAAATTCTCTGTCAAGCGCCTCCAATTTTTTGTG7CCTT 
TGATT7ACCCCAACATGACAAAGGCAGTTGTGCTTCATGTATTCAGGGAT 
ACTGCCAAACCACAAACAGG77AAAA7CAAA7AGCAGA7A7CCC7G77CC 
TAAAGACCCA7CAGCTCTACCCACC7GC7CCTGCTCACCG7CC77A77G7 
7GAG7CC7GAAGCCC77CCT7GTCAT7777A777777GCA7GAACAA777 
AG77CCC77TG7C7CAC7CC7AAACC777C7CAAAGGA77GGA77TG7AC 
ACAAAC7GCC7A7C7C7GCAATC7TAGAAGTGA7A7GA77C7GAACAAA7 
CAC7TAAC7777GATT7TTTATTGG7AAGA7GGGAA7ACCAA77777GC7 
CCAC77C7G7CC7A7G77GGCCTGGGC7GA7G77GAAAGC7C7CGG7CAA 
C7GAGA7AGGG7G7GCAGAATTTA7A7ATATAAA7ATATCT€C7CCAACC 
CC7CCCAA7GAAGCAAG7CACGTGAGTCAA7CC7ACCCTAAGA7A7TAGG 
GAT7GAGCC7CC7GGGACATTTGG7GGC77AGG7777CA7GAAAAGAGG7 
7GCAGAGCAAC7GCTT777GTTAGGCAAAGA77AGGC7AC7GCAGAGAC7 
CAGCAAAC77C7A7AGAAGGTGTCAGA7GG7AAG7A777TAGGC7TTGC7 
7GCCAGA7GA7C7C7CAAC7AGT7AACCATGC7A7TGTAGCC7CGAAGCA 
GCCAGAGACAATA7G7AAACAAGAGCA7GGC7G7G777CAA7AAAAC777 
A7T7AAAAAAACAG7CAGGGACCGGA77TGGCCAAAGGCCATAGTG7GCC 
AGCCCCAAGAC7AGAGCAATGCAC7T77AAC777777A7777A77T7TG7 
AAAA7GCCAAGA7CCACAAAAATGC7A77GCACCCCG7G7GT7AGCAC7G 
7GAC7CAAGGT77GGGAAATTCTGC77TGAAGGCGTGA7AGACAGGAGAG 
CATGG7C7GGCCCCTTGGTGCCTT7C7GGT7GCAGCGAGCAT77CAAACT 
ACAGAGCAAGGCCAGTGG7CTGT7CAGCACTAGAGACATGCAGCAAGGTG 
7CC7GGGG7GAGAAGA7GCCATAAC7GGTCCCCT7TCTA7CTCCTTAGG7 
CTTGGACTTCATTCCA7TTTC7GTTGAG7AATAAACT.GAACGTTGAAAAT 
GTCCTTTGTGGGGGAGAACTCAGGAGTGAAAA7GGGCTC7GAGGACTGGG 
AAAAAGATGAACCCCAGTGCTGCTTAGAAGGTAAGGTTC7TGTAGAAATC 
7ACC7CAGGGCCAAAG7G7AATTCC7AGAGCAGAACTTTGCTAGGTGC7G 
7GCACAGACCCAGTTG7TTCCTGCTGAC7TGCACAG7AAG7GAGC77TCA 
AAT77CCC7GGACAAA7AACTAGACAAGAGAAAT7C7GGAAGAGAAAAGG 
AAGC777GC77CAG7G7CCAGGCACA7CAGG7AG7AGA7AAAAGGA7CG7 
CC7CACC7ACAGA777GGGGCTTTAGCA7CCTGTTTGCCAAC7GGATGG7 
7GCA7A7GC77CAAAA7GCACCTC77CCCTCCCAACATTCCCAAG7GGAA 
GAGAAGCC7CCGA7GAGAAGGAAC7CTC7AAGGC7GGGCTGAACAAATGA 
CCCAGGCACAGGGCATC7GAGTAT7CCA7GAGGAACACA7T7GGGTGT7G 
C CCA7GGGGGACAATAGGAGGAGGCTTTTGACCCAAA7GA77G7CTACTG 
AGG7G7GACGGGAGAGGCCTGTGACATGCCAGAGGCCAAACCCGTGATCC 
AGTTCA7C7C7A7TCTATGTTTCTGAAGAGGGAAGCTATGA77TAATG7C 
A7TAC7A7CATGC7GC7CTAGTAT77C7CAGCACATACACAGAAGAGGGA 
A7TAAA7GG7CC77GATACCCCTAAATCCTTGGAAAATCCGAATTGCATA 
7GCTAACCTCAC7GCGTCTGACTGCAGACCCGGC7G7AAGCCCCCTGGAA 
CCAGGCCCAAGCCTCCCCGCCATGAAT77TGTTCACACAAG7AAGGCC7C 
GGGG7GAGGTGA7GGGGGTGGCTGAGG7GCGAGGGTGGGGA7GGGGGATG 
GAGCCA77GGGTCCTCTTACAGGGTGAGAGAATTGTAGAA7GGGGACACC 
7AAGGG7GC7GGATGGGGCTGAAGTCTTTCCTTTGTGGAAGCAAATCCCA 
77AGGAGA7AAC7CTGGGAAAGATGAGCCCGGGGAGGGGCAGG7GA7GC7 
CACC7GC7AAGAGGCAAAGGGCAAGGAAGAGTTTGTGCCTGGGAACCTTC 
CAGG7GCCTC7TCTGACCATAGCCAAGAGAC7GGAGACACAGACC7CC7C 
CCAGCAC7GAGGACAAACAGCCATGGGGCCAGTGGGGGTGCAGGGACACC 
CACACCAC7AAGGGC7CAGGGCGGCGCCTTCAGAGCC7GAACC7TCC7C7 
CATGCTGCCATTTGAACACCACAACACCCTAA7AGGAAACTG77AACA7T 
GCCAC7G77CAGGTG7GGAAACCGAGACAGACAG7GGAGA7TCCC7GCCC 
7AGG7GACACAGG7AA7AAGTGACAGA7GTGGAAA7T7AAAGG7AC7A7A 
ACG7C7G7C7GCC7GACTCAGGC77AAGGC7CCCA7CACC7CC7C77C7C 
AGGACAGAG7CAGGAGGCC7CAGCC7GAGCCCCAGC7CTAG7GCAGG77C 
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ATGTGGGAATACTGAGCu^CACTAGTAC^TGGCAGAGAGGACCfiAATGC^■ 

GACCAGGTGTGTAAGGGTGCCTGGCACAGTTGGGGGAGGCTGCTGTCGCT 

TCTCCACCGCTGCTGCTGCAGTTACCTTTGATGT7TTAGTTTTGTTGTAG 

TTACACCATTGCTGGCTTTGGATCTGCACTGTGTCCACTCCAGGTGGAAC 

CACGCACACAAGCCTCTCTGTCGGGCCTGTCCTGACTTCTCCTTGTCAGG 

GCTGGGATCTCCTTCAAATCTGGCGGAAGTGGTTCTCCAAGTCTGGTCCT 

CA^ACGTCAGCAGCATCAGCGCCTAGAAGTGTTAGGAATACACATTCCCA 

GGCCCCACCACAGACCTCCTGCCTCAGAAACTCAGGGCGCTGAGGCTCTA 

GGGGCTGCTTTAACAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTC 

TGAGAGCTACTGCCCTACAGAAAGTTACTAGTGCCCTAAAGCTGGCGCTG 

GCACTGATGTTACTGCTGCTGTTGGAGTACAACTTCCCTATAGAAAACAA 

CTGCCAGCACCTTAAGACCACTCACACCTTCAGAGTGGCCTTGAGAAAGA 

TTTGGGGTCAAGGATCATGAGCGAGAACACCACTTAAGAGGATAGTGAAC 

TAGTCTGCATGTGAGACGCTGAGATCCTATGTCAGGCTGTGATAGGAGGG 

AAACAGAAACCAAAGGAAAGAACAGCTTTAAGAAGCGCTTAAGAGGTACA 

AAGTAAAATGATGGTGCTAGAAAAGTAGCTTCTTAAAAAGAGCATTTTCC 

AGTCTCACCCTGGACTAACTGAATGAGAATCTCAGGAGTGTGAGGCCCAG 

GTATCCATGGTCTTAAAATGCCACCCACCAGGTGATTCCCAGTGTGCACC 

AGGGGTGAGAGTCACAGCCTTAGGCCATGCCACTCAAAGGGTGTCTTCAG 

ACCAGCAGCACCCACAGCTCTGGGAGTGCATCAGAAAGACAGAGGCTTGG 

CACCACCCACACCTACTGAACCATAGTTTGCAGGTGATTTCTTGCACATT 

AAAGTGTGGGAAATGGAAAAGCTTAGAGTTCAGCTAGCTCGGTGACTCTC 

AGTCAACC7GCACCTGCTCCATGAACTCAGACTGCCTGGGATGGGCCCAG 

AAAAGCTCCTGAGGAGATTCTGATGTAAGGCAGGGCTGATAACCATGGAT 

CTCATCTGACCCCATATCACTGGGGAGTTACTTAGGATCTTGCCTGGGGC 

CAGTCATCTCTTCCATAGACACTGAGAGTGTCCACGATGCTTGGGGCACT 

ACAGGGTGGGAGGTGGAGGATCACGGGTGAGTCAGATAGGAAGCCTGCTC 

CTGGGGAGCTTACAGTGCTATAGGGCAGCAAGCCAAGGATGCCAATACCT 

GTGTGCAGGTACCACTGACGAGTGCAGAGCGCTGCAGCACCAGAGAGGAA 

GCTACCCTGTGCAGAGGGGGCTGAGGAGGGCTGCAGGGAGATGACAGGAA 

AGCCGGTGTTACAGGAGGAGTCCTCCCCACTCTTTGGGCATGAGGAGACC 

AGGAGGACATTCTACAGTGAGAAACCCAGGCAGAGGCCATGTGCTTATGG 

CATGGGAAAAGAATGACACCTTAGACTTATTCTCTACATTAGAATTGCCT 

ACCACAGATACCCATATTATAGCTTCACATAGTGTGGTGGTTACTGTGTT 

TTCATATTGTCACATTTGCCATTTTCCAGCCACCCACCCATTCTTGACAG 

TCACTGGCCCAGCCTGGGGGCCCCTGTTCTTTATCAAACAAGTGCCTGAG 

CTCTTTGCAGAGGTGAGGGTCACCTGTCCAATCAGAGGCCAGGAGGGAAC 

GTTCCCTTTTAAGACCCTACTCTAGGCAGGCCTGGCCCAAATGAGTTGCT 

AGGAGCCCACGCCCTAAGAACCCTCTGAGCACTGTTGTGGCTGGTCCTGC 

TGCTAGAAGT7GTTCCTCCAGGGCCAGGTGCAAGATTTGTGGCTTTTCAA 

AGGAGCCACTAAAGCTCCAGCTCAGCCTTGCACGGTGCTGGGCTCCTGGG 

GGCTTCCTGCCTCCAACCCTCCCAACTCTTCCATCACCGCTCCCTTAGCC 

TGGCCAGTGCAGGGATCTGTTCCACTCTAGGCACTGCTGAGGGAATGATG 

CCTCCAGTCAGAGGGTGCAAAAAAGAGAGTTAAGAAAAACAATGATTATA 

AAAAGTCCTTTTTATACGCCAGACATTTTCTTTGCTCAGGCTAAGTGCTA 

CTTATTTGAGTAAGCATTTTAGTTCTCATAACTCCTCTCTCAAGTAGGTG 

CTGCTATTACTTTCATTTCACAGATGAGGACATTGAGGTTTGGAGAGACT 

TAGTAACTTGTCCTCTGTCCTACAGCAGAGCTGGGATTTGAATCTATCTG 

TCCAAATCTGGAACCCATTTGCTTGCACAGAAAGCTTAATTGCTTGTCCC 

AGCAAGATAGAAAGCCTGGGAGTGGAAGAAATATTCAGTGGCTGTGATGT 

CTGAGCCCACAGGCAGGGTGGAGAGCTAGGGCTGGGGCCCTTGGACGTGG 

GGAAGAAAGGGCTGAGTCTTCCATTTTCAATGTGAAGTGTTGATATCTGG 

TGATATTGATCTAGGTCCAAAGGTGAAGAACTTAAACCCGAAGAAATTCA 

GCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTC 

ATAGCAGTTCCAGATAAAAACTACATACGCCCAGGTGACTCTCAGTTTTG 

GCTGTGTTTTCTGCCTCCACCTAGCAGGGGTAAGGCCTCCTGCTAGGTGG 

GCTCAACTCCATGCTATACCATGCCCCATCTCCAGCAGGTGGTGGAAGCG 

AGGAGGAGAGGCCCCAGGGACTAGGGCATCAGATGAAGGGTCTCTAGCAA 

TGACCAGATCTGAAAGTAGTCTTTCTGGAAGGGCTGGAGAAAAAGAAGGA 

GGCAGACACTTAGACTGGAAGAAGAGGAGGCTTAAACCGGTGTGATGGAG 
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GGAGAAGTGGACCACAGAGTCAAGGGAGAGGGACTGTGCATCAGGCCTGA 

AACCCCAGCAGACAGGAGAGACCTTTCCCTGCTCTCAGAACCCACACATG 

TTCTGACTGTCTTTTTCCAGAGATCTTCTTTGCATTAGCCTCATCCTTGA 

GCTCAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTC T CTAAA 

GGGGAGTTTTGTCTCTACTGTGACAAGGATAAAGGACAAAGTCATCCATC 

CCTTCAGCTGAAGGTGAGAGTTCTAGCTCAGTTTCCTGGGCCTTTGGCTA 

CCCCAAAGTAAAAGGCCAAGATCCTCAATGCCTCTCGCTTTCCTGCAAAT 

TCTTATCTTGGCCAATATAACAGGGACATCCACCTTTCTGGAAGCACCAG 

GCAGAAGAGCCCCATAACTTCTTCTCTGGTTCCTTGCCCCTTCTAGGGAA 

GGAGGAGAGACTCCTCACAGCGGGGAGACAGCAAGGAGCTGAGCACCTGT 

TCTCCTCTCCTGGGCTCACTGGTCCTGGCCCTGGGCGGGTGGCGGTCCCC 

TCCTGCTGTGGCCCTCCATGTGGCAAGCAACACAATTGGGCCAGGACCCT 

GGCGTGCTGCTGTAGGGTAGGAGGGTGTGAGGGAGCACTCGGAGGGCAGT 

GTGTCTGCCCTGCAAATTTAGTCCTGGATGGAGCATCCTTTCACTTGAGG 

GGAGAAATCTTAGGAAGCTGAATTAGATACAGATCTAAGCCATATTCTCT 

AATTTTAAAAACTATAGAGCTGAGATTTTGGTATCCATCTGACTCTTACG 

TCTCTCTCTCTCTCTCTCTCTCTCTCAGTTTATTTTTAATCTGGGGGACA 

AGAAGGCCTGGAAAAGAGGGCATGATTGCTTATCATCCCTTAAATACCAG 

TACCAAGGCTGACACGTCATCTTTCCCAAGGACCATCTGCCTTCTCTCTT 

TTCCTCCTCTCCTGTGTAAAGGCCTGGAGGATGAGCACATGTGCTGTGTT 

TTCCTCCCTCTCAAAGCCTGTGCTATCTAATTAATCCCTTTTACCTCACA 

GAAGGAGAAACTGATGAAGCTGGCTGCCCAAAAGGAATCAGCACGCCGGC 

CCTTCATC7TTTATAGGGCTCAGGTGGGCTCCTGGAACATGCTGGAGTCG 

GCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCC 

TGTTGGGGTGACAGATAAATTTGAGAACAGGAAACACATTGAATTTTCAT 

TTCAACCAGTTTGCAAAGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAG 

GAAACTGCCCCATTGAACGCCTTCCTCGCTAATTTGAACTAATTGTATAA 

AAACACCAAACCTGCTCACTAAACTTTCTGTCATTGGGTTTCATTTCTCA 

TTCATGCTTTAAGGATTTGTGTTTTTAGGATATAGCAAGAAGCTTGTTTA 

ATTACAAAGTTCTGGGTTGGAAAGAGACCGGCTTCTGCTTGTGTACTGCT 

ACCCTGAACCATCAGACATGCATGTGTGTGTCATATGCTATGATGTGGCC 

AGTCTGAGTGCAATACTTGCAGCGGGAAGGAGCAGCTGGGTGCATGCTGT 

GCTCTAGAATTAGTCTTTCCTACTGGGGTTTGGTAGATTCTGAGGGCATT 

GATCCTGGGGCAGAAGTGGCTGAGTCTGTGTCTAGGGTACAGTGTGCAAG 

AAAGAAATGTAACAGCAAGTCACAATCCAGCCAAGTGATAGTGGAAAAGG 

GGTAGTTAGGTCCCAGATAAGGAGCAGGGTGACTTGACCTGTGGGAAAGG 

CACAGAGACAAGGAATCTGGGTCAGATGACAGCCAGGAGACCAGGTGAGG 

GAGGAGCCAGGTACTGTCTGGGAGGCTTGTCAACAAGGGCATGGTCCTAT 

CACTAAGCAGGGCTCAGATCCTCATAATGGGGGAGTGGAAGGCTGGCCGA 

ACAGAAATCAGGGCCTGGAAACAGAGTGAGGGGGTGGAGACAGGAGACTG 

AGGCTTGGAAATTAGTTTATTAGTTTTAGCTCTTCAGTTACAAGCAATAA 

TAATAGCTTCTAGCTTATTTAAGCAACAAGTATACTACAAAAGGAGCTTT 

CTAGAAGGATATTGGGTATATTCATTTCTTACTGCTGCTGTAACAAATTA 

CCACCAACTTAGTGGTTTAAACAATGCAATGTATTATCTTGCAGTTATGG 

AGGTCAGTCTGGAATGTGTCTCACTGGGCCAAAATCAAAGTATCAGCAGG 

ATAGCATTGCTTTGGGAGGCTCTAGGGGAGAGTCAATTTCCTTGCCTTTT 

CCAGCTTCCAGAGGCCACCTGCATTCCTTGGCTAGTGGCCCACTCCCATC 

TTCGCTGCTTGGGTTTTTCTCACACTGCTTTGCTCTGACCCTCCTGCCTT 

CCTCTTTCACATATAAGAACGCTTGCAATTTACATCGGGCTCACGTCAAT 

ATCCAGGATACTCTCCCGTCTCAAAGAGGCTTAACTTTAATCACAGATGC 

AAAGTCCCTTTTGCTATGTCATGTAACATATACACAGGGTCTGGGGATTA 

GAATGTGGACATTTTCGGGGTGCCATTATTCTGCCTATCATGTGAAGTAA 

CTTTCAAAATGGAAAGACATGCTGAAGAAAAAGTCAGGGATTTCTGGCAG 

GCCAGAAATGACAGAAGGCAGAAAACGTTGGTCCCATCACTCAGATGGGT 

AAGAGCCAATCATGCTTTTTGTCAGTTAGCAAAAGATTGAGATTCCAAGC 

AAAGCATGCAACTGCCCTAGTTTGGGTCATGTGTCGACTCCTTGGTCAGT 

GAAGGGCAGCACACCTTGATCAATACTCCCTCCAAGACTGTATCCAACGA 

GGCCAGTGATGTTCCTCAAAGCAGAGCTAGAGAGCTAATCCCAGGAGAGA 

GGCGTG7GGGTGGTGGGCAGGAAGACAAAGCTCAGCCGTAAAGGAGTAGT 

AGGGACAGCACCCTAGGCATGGAGGCTCAAGTGAGATGATACCCATGGGA 



FIG. 4 (55 f 61) 




WO 99/06426 



PCT7US98/16102 



AAAGCTCTGATAAGGTC^oCTCCTTCTGTTTCTGATCCTGATGGTGATGij' 

TGATCAACACCAGCCCAGTGACAAAAAAGTACATAGTATATTTAGTAGAT 

GTTTCCCACACAGAGAAATGGTAAATATTCAAGGCGAGGAATACTCCAAA 

CATCCTACCTTGATCATTACACATTCCGTGCATGTAATGAGTACTTGCAT 

GTATGCCATAAATATGTGAAATATTATGTATCACTATATAAAAGAAAAAA 

AAATGTGGCCAGGTGACATCCATATTTTGGAGAGGAAGGCATGTCTTCTT 

CATAATATCACAAAACTATTTTCACAACAAAGACACAGCTGTTCAAATTA 

GTCTCTGAGCCGGGGCTGTCTCATGGCAGTGAGGACTCTGGTTCCCTTAC 

AGACTAGCAGAAAGGAGATGGGGCTTACTGACCATGGCCTTGAGGAGGCT 

GAACATGCAGGCCAAATGGAGACACAGACAGCCTGGGCTTGGTCCTGCTC 

CATCCCCTTCCAACCTGATGAGATATAGTGAGTCACTATGACGTGGGTCA 

CTCATGCTTCCTGTGAGGCTCCACCAAGACAGCAAGTGCATCAACACCTT 

ACGGAAGCACAAGGCCCTGTTTGTTGTTGACTTCATGAAAGGCATGGTTG 

TGGTGATCGCATTGAGTAGGCTTTTGGGTGAGAGGTGAAAAACCCCAACT 

ATCATGCATTGCAGCCCTCTGGTGGAAACTGTGCTTCAGGCTCTAAATTT 

CAGGCTCTAGACTGACTCCAGGATGAGTATTTGGAAGCTGAAGTCAATCT 

GTGGTCTCTTCTCCTGTAGAGCAGGAGTCAGCACTTTTCATAGAGTGCCA 

GATTCTATATATCCTGCCACATGCTCTGTTGTTACAGAACAAAGAAGGCC 

ATAGACAGCATGGCTGTGTTGGCAAATACACAAAACAGGCAATAAGCTGT 

ATTTGGCCTTTAGGCTGCAGTTTGCCAACCCCTGCACTAACACAGAGCTT 

AAAGGTGGTGGTGGTGTGCTGGAGCTAGCTTATATCAGCTTGCAATAGCC 

AATTGCTAACATCTCTTCCAAACTCTGTGTCTGTGCCTTGATGTTGATAG 

TTTGAAATTGGCTACCCCATTTAATGCTGCAATCT7TTCTCACCCCAGCA 

CTACTGACTCCCCTTTGCCCTGTCTTATTTTTCTCACTCTAACATGCTGT 

ATAGTTTTCTTCTTACATTTATTGTTTGTGTCTTCCACTAGCATGTATGT 

CCCACAAGTTCTTTGCTCTGTGATGTATCCCAAGAACCCACTGCAGTGCT 

TGGCACTTGTAGGAACTCCATAAGATTTTTATAAATGAAGAAAGGAAGAA 

AAAAGAGAGGGAGGGAAAAAGGAAAGGAAGCCTTCTATTTAAATGATGGC 

CTTCTCCATATTTCTATAGTAATATGACTTCCCTTGCAAAGGGGGATGCA 

TTTTGGAAAATGTGTATAAATAAACTCAGGTGGTTTTGAATTTCATTTTC 

CTAACTGTAATTGTAATCATTGGTCTTTATGTTTAGTGAAAAAGTTTTGG 

CCCTTATGCCTCACACCTGAGAATCCCAAAGTATTGGTTTGTTAGAGCTC 

CCATAGAGAACCATAAACTGGGTGGCTTAAAACAACAGAAATGTATCGTC 

TCCTGGTTCAGGAGGCCAAAGTCTGAACTCCAGGTGTTGGTTCATTCTGA 

GAGCTCTGAGAGAGAATCTGTTCCAGGCTTCCCTTCAGTTTGTGGTAGCT 

CCAGGGTTCCTTGGCTGGTGGCAGCAAAACTCCAGTCTCTGCCCCCATCT 

TCACATGACTGTCTTCTCTCTGTGTTTCTGTGTCCAGATTGTCCTATAAG 

GACAGAGTCATACTGAATTAGGGCTCACTCGAATGACTTCATCTTAAGTT 

GAACTGTATCTGTAAAGACCTTATTTCCAAGTAAGGTCACATTCACAGCT 

ACTGGGGGATAGGACCTCAACATATCTTTTTGGGGGACATAATTCAACTC 

ATAATACCCAACATGATAACTGTTCATCCCATGAAATTTAATGTCTCTCA 

AAAGGTGATCTCAGGGCATTTAATCTGTGACAGAAACTCCCATAGGAAAC 

ATTCCAACCAGAAGCTCCTTTCACAGCTGGTCACTCCTCCTACCCCATCC 

GAGGTCCTGGGGCAGGGTGAGGCAGGTGGGGACAAGAAGAAGGCTGTCTC 

GGGTGTAGAAAGAGAAGACCCTTATTCACCCGGCACTCTGTTCATGAATG 

AGCTATCCAGCATAGGATATAATAAATCGCTTTAGGAGTGGTAGACTCCA 

AACATTTTTTTGGTCCCAGTTATCCTAATCAATTAAACAAACTCTAGAAC 

CCATCTTGAAGTGCAGGCATTGGGACATTATGAAACTTACACAGAATTCA 

AAAATTTACAAGGGCTAAATAAAACAGGGTCTGACATCTAATATTTTCTT 

CCCACATTCCCATGCACTGTCTGGCTCAACCATCCCCAACCCTCACTCTC 

ATCCTGGTGGACACATGCCTAGTGATGTGATCAGCTGGTTCACAGGGGGC 

TGGTGATGGTGGATATACAGCTTTTGCCAATTTCCATGGCATAACTACTC 

CAAATATGGCCAATTTCAAACTACCAACATGAAGGCACAGACACAGAGTT 

TGGAAGAGATGTTAGCAATTGGCTATTGCAAGCTGATATAAGCTAGCTCC 

AGCACAGCACCACCGCTACCTTTAAGCTCCTTGTGTTAGTGCAAGGGTTG 

GCAAACTGCAGCCTAAAGGCCAAATACAGCTTACTGCCTGTTTTGTGTAT 

TTGCCAACACAGCCATGCTGTCTATGGCCTTCTTTGTTCTGTAACAACAG 

AGCATGTGGCAGGATATATAGAATCTGGCAGTCTTTAATAAGTGCTGACT 

CCTGCTCTACAGGAGAACACAGATTGTCTTCAGCTTCCAAACATTCATCT 

CTGAG7CAGTCTAGAGCCTGAAATTTAGACTGAAGCACAGTTTCCACCAG 
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AGGGCTGCAATGCATGA1AGTTGGGGTTTTCACCTCTCACCCAAAAGCCT 
ACTCAATTTTTTACTGCAAAAACATGTTATCATCATTATTTTTTACTTAG 
CCCACCTTTCCTTGGCAATTTTCCATAGGAAAATGCATTCTAAATTTCAA 
CTAATCAGGGGACTTGGAGCCTCTGGACACCCCCTTGTTCCTTGCCCACA 
GTCCCTTGCAGAAGGTGCCTTATCAGAGCGGCTCCATGCAGGGGCTCAGG 
ACAGGATCAGATGTCAGTTGCACCAAGGGGGCAGGGACAGATCCTCTCTG 
CTSACCATGCAGAAGGGACTGTTCAGTGCACCGTCATGGTCCTGGTGATT 
TCTGGTCCATAAGGGAATTTTCACATGCATCGGGTGATTGTCACATCAGC 
ACAACACTGTGAGGAAGGCAGAGTGAGAATTTGTGTGCCCATTTTATAGG 
TGAGAAAACAGATGCAGAGACATTAAGTAACTTCACCACAGTCATGCGGG 
TTTTAAGTGGCAGACTTTCAGGTGTTGTGACTCCTAGTCCAGAGTTCTTT 
GCACTGCCCCTGAGGTGCTAAAACTCTACTGTGCTTTAAGACTCACTTGG 
GGAGCTTCCTAAAAAGAGAGATTGCACAACCTGAGATTCTTGTTTAACTG 
TTTTGGGATGTAGCTCAGGGATCTAGCTGCCTTAAAAAAAAAAACTCCCA 
AGTAATTCTGATGCAAGCGGTTCTTTTTTGTCCACCTTTGAAGAAACACT 
GCCTCCTCCCCATACATTTCATTAGAAAATGGTAACATGTTTTTCAGCCT 
GAGAGCCATTTCTGGGTGACCGGACGTCGGCAGCCCGCTGTACTAGCTTT 
CAGTCTAGGCTTAAACACACATGATAGGAGATGTCCTACTCCAGATGATA 
TGAGTCTGAACCATGGAAAAATTCCATTGTGTGGCACATCTGGTGGGTGT 
GCACTGTCCCCAGCAGTGAGGCACCCAGTGAAGACAGCAGCTGGGAGAGG 
CTTAGTTACATGCAGTGGGACAGTGTGGGCTAGACTGCTGAGCCCtCTGC 
AGTTTACTCTGTGTCAGGCAATGAGGGTGAAAGGCTGATCAGACCCACGT 
GCAGACCATACCCTCCAGGGAGACAGATATCAGTCAGGACAACCCCAAGT 
GTAGCTGGAGAAGCAGTGCCCAGGTATGACCGGATGTGTATCCAACCAGG 
AAATCTGCATATAAATATAAGAGGAGAAAATGAACAGATGTTGCTCTTAT 
ATGTAGATATTTATGAAGAGCATATAATTTTGTTTTGTGTGTTTTAAGAA 
GTTTATAAGTATGCCTTAAAAATGTATAGTATATACTGTAGGTATTTTTT 
CCATTAGATATTTTGTTTTTCATACTTATCCACATTGACATTGTAGCAAC 
AGTATAATATAACAACCTCCTCTACAAAAGCAGAAGGAAGTGAAGCTTTG 
GAAGGAAGCACCCAGTGAGCTTGCCCCTTTCAGGTGGGTGCAGTGAGCAG 
GAGTCAGTGAGGTTGAGATCCTTTGAGAGGAGGCAATCATTAACCAGGAA 
ATCTGCACTGCATCCTGGCCACACCTAACCCTTGGACAATGGTGCTTGGA 
GCGCCTTCCAGCTCTTAAGGCTTGCGATTTCTTTCTCTCACTCTTCACCC 
ACGATGATTAAATCTTCTCCTACAGAGTTGGACAATAAAGCCTTGAGTTC 
CTGCCTCCCCTGGTGTGATCACGAGGCATAGACATGGCCAGGAACATGTA 
GGTGTCTTTGAAAGCTGAACAAGTTAGTAAATTTCAAACCTCATTTCACC 
CACCAGTAAAATGGGAATAATAATAAACCTATTTTACATAGGGTTGACAA 
GAGGAGTAAAGAGGGATTCAATGAAAGTTCGTTATTATCATTTGTAGTAG 
CAGTGTTGATAATATCAACTGAAAGTTCATTATCATTATTAGTAGCAGTA 
TTGATAACCCTCTTTTCTGTGCCTTCTCACTGGTGGGCCCAGGCCATCAG 
CAATGCCCAGGGTGTCATGGATCTCTGCTGCATCGGGCACCAGCTGTGTC 
AATGGTGAGAACAGTACAAGGGTGGGCAGGGCAAGGCAGGAAGCACCCAG 
GAGCAGCAGCTTCATGGGGTGAAGATGTCAGGAGCTTAGGGACAGTCAGA 
GCGGGTGTGCCTCCTCTTGTGGAGCCTTTCTGCGTGGGTAGGAACTGCTG 
CAGCTGTGGCCATGGATTCACCTGAATATGGGTGGAATTAGGCATTCAGC 
TGGGTTAGCTGTGCCTAGAAGGAGGAACTCTAAACTGAGAACTTGTCCCT 
ATTGCCACCTCTGATAGGCAGATGATCCATCCATCAGTGGCTGAGCTGAG 
GTGTGCATGGGGATGGGTAAGAGCCCACACACAGGGCTGATGACTGAGTC 
TATTTAGAACAATAGATGTAAAATCTGATAATGTAAAATGTGATAGATTA 
TTTTGTCAATTAGAAATGGTACCATATAATTATATATATACATAAACATG 
TATACATATACACACATATACATGTGTGTATAAACACACACAGTATTGTC 
CCCTACTCATTCCATAAACCTGATGCCTTTAGCTGGGATTCCCAGCTTTC 
ACTCTCCTCTCTGTCATCTGCTGTCTATATCCTCCCCATCCTGTAATTCT 
GGCTTATATGCCACTTCCTCCCTAAAGCCCTCCCTCAATCCCTTGCTGGA 
AGTGACATTTTCCTCTTTGAGCTGCCCCTGCTTGTGCTTTGGTGAGGTCA 
GCTGTATTGCAGTACCTTGTATTGTGGTTGTCACATCATCGTATAGAATT 
AATTTCTGACACATTCCGTATTTTTCAAAGGGCCTAGTGTGGGGCTTTAA 
CAGTAACTACGCCACCACGCCCAGTTAATTTTTTGTATTTTTGGTGGAGA 
CAAGGTTTCACCATGTTGGCCGGGCTGGTTTCGAACTCCTGACTTCAGGT 
GATCTGTC7GCCTCAGCCTCCTGGAGTGCTAGGATTGCAGGCATGAGCCA 
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CTGCACCCAGCCA.CCTATCAAAATTTTAAGTGCCATTTTTATTTT^ATT 
TTTTGTAGAAATGGACAAGCTGATCGCAAAATTCACATGGAATTGCAGGA 
GGTTCCAAATAGCCAAAACAATCTTGAAAAAGAAGAACAAAGTTGGAGGA 
TTTACACTTTCCAGTTTCAAGACTTAGCTCTTAGCTACAAAGCTACAGTA 
ATCAGAACACTATGGTCCTGGCATAAGTGATGCTGGACAGGTGAGCCCCA 
AAGTGGGACTTAACCTGTGAAGGTTCTTGGCCTTGCCCAGGAAGGAATTC 
AAC^GCAAGCC^TGGGAC^AGAAAACAGCTTTATTGAAGGGGCAGTATT 
ACAGCTCCAGCCCTGTTACAGCTCCAGCCCTGTTACAACTCTGACTACTC 
CTGCACAGAAGGGCTACCCTGTAGGCAGAGAGTAGCAACTCAGGGCAGTT 
TTGCAGTCATTTATATCCACTTTTAACACATGCAGATTAAGGGACAATTT 
ATGCAGAAATTTCTACGGAATTGGTAATAACTTTTGGGTCATGGAGTCAT 
CATGGAAGGGGGGCGGGGAACTCCCTGGTGTTGCCATGATGACGGTAAAC 
TGATATGGCGAACTGGTGGGTATGTCACATGAAAAGCTCCTTCCACCCCA 
GCCCTGTTTCAATTAGTCCTCGGTTTGGTCCAGTGTCCAAGTCCTGCCTC 
CAGAGTCAAGTCCCACCCCCTACCTCTTAAGGAGAGATGTAAATACATGG 
AATAGAATTGAGAGTCCAGAAATAATCTCATACATCTATGATCAATTGAT 
TTTCAGCAAAGGTGCCAAGACCATTCAATGAGGGAAAGAATCATATTTTT 
TTCAACAAATGGTGCTGGATAACCACATGTGAAAGAATGCAACTGGGCCC 
TTATCTCACACCATATACAGAAATTAACTCAAAATGGCTCAAACACTTAC 
ATGTAAGAGCTAAAACTATAATATTCTTAGAAGAAAACAGGGATATATCT 
TTATGACCTTGGATTTGCTGGCTGATTCTTAAATGACACTGAAAGCACAA 
GCAACAAAAGAAAAAAAAATAGGTAAATTGGACCTCATCAAAATTTAAAA 
CTTTTATGCTGGGTGCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGG 
CAGGAGGATCTCTTGAGCCCAAGAAGCTGAGGCTACAGTGAGCCGAAATT 
GTGCCACTGCACTCCAGCCTGGGTGACAGAGCAAGACCCTGTCTCGAATA 
AATAAATAAACAAATATATAATTATAGATCTCTGGATCTTGCCTTCGGAG 
ACTGACTCAACTAACTGGTCTGGGTGGGAGCCCAGCCATTTGTATTTTTT 
GAAAACTCTCCAAATGATTTTACTGTGCAGCCAAGGTTGAGAATCACTGT 
ATCATAGGGTTGGACTCCTAACTGGAAACAGTTTGCACCATCAGGTGTCG 
CAGCATTCTGATAATAG TTAA GCTTTCCTCCTAGATTTTCTGATATTAGA 
TGAGT CATGTTTACAAGTTTTTAC CAAGAGA CAAACTATCTTTCT G C C CT 
TACTTTCTCTCTTATACTATTCTAATCCCAGAACCCTTTGGAACTTCCAC 
TGAGAGATGAATCTAGAAAGTGACTCTCTTGGCTACAACAGAGAGTAATG 
TTGGCCTGTTTGTGCCAGATCCAGTTGGTGCTGGTGGTGGGACAGCACCT 
CCCTGAAATCCCCTCCTCTCCCGTCAGATTCAGTCCCCCATTTGCATCAC 
GTACAATCATCACTATGGGTTTCTATTACCTTGCTAGGGCATTTGGAGGT 
ACCATATATACCAACTATTAGTTTTGAGCCATGGTTCCCAAAGTGTGGAC 
TGTAGGGCACCTCAGCACACTCACGAGGTGTCATGGGATATTTAAATATT 
CTGAAGAAAACACAGTGACATCTGTCAGGCCCGTGAAAACCGTTGGCATT 
AAATTGTCTCAACCCAATTGCTTAAGAAGCAGAACTGGCCAGGCACGGTG 
GCTCACATCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCAGATCAC 
GAGGTCAGGAGTTCGAGACCAGCCTGACCAACATAGTGAAACCCCGTCTC 
TACTAAAAATATAAAAATTAGCCATGCATGGTGGCATGCACCTGTAACCC 
CAGCTACTCAGGAGGCTGAGGCAGGAGAATTGCTTGAACCTGGGAAGCGG 
AGGTTGTAGTGAGCCAAAATCGTGCCACTGCACTCCAGCTTGGGTGATAG 
TGAGACTACATCTCAAAAAAAAAAAAATGAGAGAGAGAGAGAGAAGCAGA 
ACCATCAGGTGTTTCTTTTGGCTTAAAGTACTCTGTGAAGAAATTCCTGG 
GACACGAAGGATACCATGAACTGAGAGATTTTGGGAACCTCTGCTTTAGA 
AGCTGGAGGTAGCATTCCTTGGGCACAGTACTGCG!TTGGGATCAGCAAAT 
CCTTTTGATGGTGCATTTAGGTGTGGCAAGACAGCTCTTAGAGTGGGACC 
GGGATGTGCTTGGAGACAGAGGGAACTAGATTGAGCTGCCCGATAAAGAC 
ATGCCAGCCTGGCAGAGTGTAGTGACTCATGTCTGTAATCCTAGTGCTTT 
GGGAGGCTGAAGTGGGAGGATTGCTTGAGGCCAGGGGTTTGAGATCAGCC 
TGGGAAACAACAAGAC CT CTACAAAAAAAAAAGAAAAAAAAAATTAAC CA 
CATGTGGTGGCATGCACCTGTAGTCCCAGCTACCTGGCAGGCTGAGGTAG 
GAGGATCACTTGAGCCCAGGAAGGTAAGGATACATTGAGCCATGACTGTG 
CCACTGCACTCTAGCCTGGGTGACAGAAAGAGACTCTGTCTCAGAAATAA 
ATTAAATAAATAAATAAATATATAGTGGCCATGACATCCCTAGAAAGACA 
AGGTCCTGGGAATAGGTAGAAGCCAAGGGAAATGAGAAATGAGAGGGGGC 
CCTGGAGCTGGAACTGGGGGAGCAGGATGGCCTCTGAGAAGTTCCTGATA 
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GTGGTGTCACTGATGTGTCTGATGTTTAGTTGTAATTATTTGCTGGGCCC 
CTGTCATCCCTCATATCTGATAGCTCTTTGCTAGTCAAAGTGTGGTCTGG 
GGATCAGCGGCATCAGCATCACTTGAGAACTTGTTAGAGATGCAGAATCT 
AGAGCCCCACCCGGGACCCAGAAACAGAGCCTGCATTTTAACAAGCTCCC 
CAGGTGATTCTCACACACACTCGCATTTGAGAAGCACTGGGCTAGTTGAC 
AGATTCTCAGGCATGGCTGACATTGAAATATCCAGGGAGCAGGCTTGGCA 
TTAGGATGTTTAAAAGTCCTCCAGGTGTTTCTAAAGCCAGGTTTGAGGAA 
TTACTGGGCTGATACAAATGTTTTGTGATGATGCTTTGTGTGTGTGTGTG 
TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTAGGGAATTC 
TGGGTCACTTGGCACCAACACAGGAAACAATGGAAATATGTGAGCCATGA 
CAGAAAGGTCAGGAGATAAAAGAAATTAGTGACATGAGAGGTACTCCTCA 
GGTGTTAGGAAAGAGGGTAGAGCAAACCAGGTTTTCCACCATATGTTGGA 
TAGGGGGTCAAGTAAATTTCTACTTAAAAATTACAAACAGGGGCTGGGCG 
CGGTGGCTCATGCCTGTAATCCCGCACTTTGGGAGGCTGAGGAGGGCGGA 
TCACAAGGTCAAGAGATTGAGACCATCCTGGCCAACACGGTGAAACCGTG 
TCTCCACTAAAAATACAAAAATTAGCTGGGCATGGTGGTGCGTGCCTTTA 
TTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCTGGGAG 
GTGGAGGTTGCAGTGGGCCGAGATCGCACCACTGCAATCCAGAGCGAGAC 
TGTGTCAAAAAAAAAAAAAAAAAGAAAATTCCAAACAGGATGACCCTAAG 
CCTGCAGGACTTGGAGACATCTAGGTGACTGATACTCAGTCACAAAACAT 
AATTGGTCACAGGCCTGATGAAATGCACAGCAGACCTTCAGATGGTATGC 
ACTCAAGTGATATCCACAAGTCCACCTAAAGAAATGCTATATTCAGACAT 
TTGGCATCAATCTCTATCAAACAAAGATAGTCCAAAGCAATGGGTTCCAA 
AAACACTTTCCTAAGACAAATTCTCTATTTGCTTTTAATATCAGTCATCC 
CAGCCCTTGGAATAGAGGAGCAAATGATACCAGTGGTACCCTACCACAAT 
GCACCAAGGTATTATACTCTCATGCTCCATTTTCTCCCTCTGTCTACATC 
ACTAATAACTCATTGATTTCTGGTGCAAGCCCTGCTGGGAGAAAAAGTCT 
ACTCTTGTACCTTGGAGCAAGTTGCTCAGAGTAGGTATCGAGGATAAAAT 
TTGGAAAGTTAGAAAAGCTATTAGAAGGAGATCCTAGTAGTTGAAAACAC 
AGCCTGGCCAAGTCAATGATGCTATTTCATCTCCCCAGCCTTGCATGTCC 
ATAGCTAAGGAAGACAATTTAGGCTTGGGCTAGAGGATGGGAAAGGGCAA 
AATTACTGATGCCACAGCCCAGAGAGGTATTCTAGTAATCTGAGGGTGAG 
GACCACATACCTGGTTCAGGGACGTACAGTGTTGACAGCTGTGAGTGGAT 
GCCTGGAGTTCTGGCGTGTCTTCTAGCACAATGATACCTGAGACTCTTGC 
ATCATTGGGAATAATAAAATGGGAGTGGATAGATATGAAATTATGATGGC 
AATAAGCAATCAGCTAATAGCTTCATTGATGGGACAGATTAAAGATGGCT 
GCAAATCCTTTGGTCCAGGTTTGGGATATAGGCAGCATTTGTATTGGAAT 
GCTGATAG7CTGAGGCCATGAAAAGTCCACCTGCAGTAGTGGTAGGAGGA 
ACAAGCCTCACTTTCTTCAATGTGTGTGACTGCTGTCTTGATTCCCTGGG 
TGGCCAGTTCCATTCGTGTGGTTCTTTGGTCCACTTGACTCTGGGGTGGC 
TCTGTGATGGCTTGACCAATACAATGTAGTGGAAATGATGCTGTCATCAT 
TTCCAGCCTCTTCCAGCCTTAAGGAACTGGCAACTTTTATTTCTGTCCCT 
TGGAATACTTGTTCTTGCAACCCATCCATCATACAGTGAGAAATTCTAAG 
CTGCCCCATTAAGAGGCCCACATGGTGATAAATTGGGGTCTTACATACAG 
CCCTAGCTGTGCTCCTAGCTGACAAACAGTAGCAACTTGTCACCAGGCGA 
GTGAACCACTTAGGACTGTATACTCCAGCCCCAGTTGAGCAATGTGGAAC 
AGAGTAAACCATCTCAGCTTAGCCCTGCCCAAACTGCAGAATTATGAGCA 
AAATAATCCCCTAGGCTTTGGGCTGATTTGTTCCAGATTACTGGAACAGA 
ATTTGGTACCAGGGGTGAGGTGCTACAGCAATGAAAGCTTAAGACACGTG 
ACTTTGGTTTTGGGTCTGAGTGGCAGGGGAACTTGGCAGGCCTCAAGGAA 
ACTTTTAGGGAGGGTTGAAGCATAGTGAGGAAAACAGTAGGGGAAGCTAG 
AGGAAAAAATGATGCTTGGTATGTAGTGGTGGGAAGTTTAGCAAAACTCG 
CCTGATGTAATGTGGGAAATTGTAAGAACTCAGAACGATTTAAGGGCATG 
TTTTATAGGTCCTTTAAGAAACTTCTAGGCCAGGCGCAGTGGCTCATGTC 
TGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGCGGATCACAAGGTCAGG 
AGATCGAGACAATCCTGGCTAACATTGTGAAACCCCGTCTCTACTAAAAC 
TACAAAAAAAAATTAGCCGGGCATGGTGGCGGGTGCCTGTAGTCCCAGCT 
ACTAGGGAGGCTGAGGCAGAAGAATGGCGTGAACCTGGGATGTGGATCTT 
GAAGTGAGCCCAGATTGTGCCACTGCACTCCAGCCTGGGCAACAGAGTGA 
GACTCCGTCTCAAACCGAAAAAAAAAAAAAAAAAAAGAAACTTCTAGGGC 
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TGGTCCCGTGGAAGCCTCACACATGGTACACAAAGGCTGTCTTGAAAAGA 
AACGTAAGTGTGTTTTTTGGTTTAATAAAATTGATTATAAATGGATAATG 
CAAAACATTTTAAAGAATTTTACTAGCTTACATTAGCAGATTTGGATCCA 
GTGATTGTTACATTCTGGTACTGAGCCCCTGAATTACTTCTTTGAGTAAG 
GCATTATACCAAAGCTATTGATAGTTGGGCTTATAGGGTGTATGTTTGAA 

gaactactaatgtcaaaaccaatatttcacggtcgacaagaggacatcag 
aw:tggtaatccttattaccatgactggctggacagaatactcaatgtaa 

T&GATTTCCTGCAAATAAAGACGGGGAAGATGTAAAAAAGATGCCTGAA 

CATTCAACATTAATGAAAGATTTCAGAAGAAATATGTATACTAACTGCAG 

CCTTATCAAGTATATGGAAAAACACAAAGTTAAACCAGATAGTAAAGCAT 

TCCACTTGCTTCAGAAGTTTCTTACTATGGACCCAATAAAGTGAATTACC 

TGAGAACGGGGTCCCTGTTTCTTCGAAGACCCACTTCCTACATCAGACGT 

TTTCAACAGTTGTCAAATCCCCTACCCAAAATGAGAATTTTTAACAGAAG 

AAGAACCTGATGACAAAGGAGCCAAAAAGAACCACCACCGGCAGCAGGGC 

CATAACCACACGAATGGAACTGGCCACCCAGGAATCAAGACAACGGTCAC 

ACACAGGGACCCCCGTTGAAGAAAGTGAGGCTTGTTCCTCCTACCACTAC 

CTCAGGTGGACTTTTCACGGCCTCAGACTATCCGCGTTCCAATCCACATG 

CTGCCTATATCCCAACCCTGGACCAAGCACATCCCAGCCGAAGAGCAGTG 

TAGGATACTCAGCTACCTCCCAGCAGGCTCCACAGGACCCACGTCAGACA 

CACGGGTACTGAGCTGCATCGGAATCTTGTCCGTGCACTGTTGTGAATGC 

TGCAGGGCTGACTGTGCAGCTCTCCGTGGGAACCTGGTATGGGCCATGAG 

AATGTACTGTACAACCACACCTGCCCAGTAGCCAAGTTCCTTCCACCGCT 

TTTCACAGATCGGGGTAGTGGCTTCCAGTTTGTACCTATTTTGGAGTTAG 

ACCTGAAAAGAAAGCGCTAGCACAGTTTGTGTTGTGGATTTGCTACTTTC 

ATAGTTAACTTGACCTGGCTCAGACTGACCAGTACTTTTTTTTCCGTGAC 

AGTCTATAGCAGTTGAAGCTGAGAATGTGCTAGGGGCAAGCGTTTGTCTT 

CATATGTCATGAATTCCTCCAGTGTAACAACATTATCTGACCAATAGTAC 

ACACACAGACACAAGGTTTAACTGGTACTTGAAAACATACAGTAGGTGTT 

AACTCAGTGAAATAACCAGGACTCAAAGTAAGATTATTTTGGTACACCTT 

TCTTGTTAGTGTCTTATCAGTGAGTTGATTCATTTTCTACATTAATCAGT 

GTTTTCTGACCAAGAATATTGCTTGGATTTTTCTGAAAGTACAAAAAGCC 

ACATAGTTTTTTTCAGAAAGGTTTCAAAACTCCTAAAGATTAATTTCCAA 

GTATAAGTTTGTTTTTATTTTCAATCTATGACTTGACTGGTATTAAAGCT 

GCTATTTGATAGTAATTAGATATATTCTCATTGATATAAACCTGTTTGGT 

TCAGCAAACAAACTAAAATGATTGTCACAGACAATGCTTTATTTTTCCTG 

TTGGTGTTGCTTGTGGGAAAAAGAAAGAGAGATCAGATTGTTACTGTGTC 

TGTGTAGAAAGAAGTAGACATAGGAGACTCCATTTTGTTCTGTACTAAGA 

AAAATTCTTCTGCCTTGAGATGCTGTTAATCTATATAACCTTACCCCCAA 

CCCTGTGCTCTCTGAAACATGTGCTGTGTCCACTCAGGGTTAAATGGATT 

AAGGGCGGTGCAAGATGTGCTTTGTTAAACAGATGCTTGAAGGCAGCATG 

CTCGTAAGAGTCATCACCACTCCCTAATCTCAAGTACCCAGGGACACAAA 

CACTGCTGAAGGCCGCAGGGACCTCTGCCTAGGAAAGCCAGGTATTGTCC 

AAGGTTTCTCCCCATGTGATAGTCTGAAATATGGCCTCGTGGGAGGGGAA 

AGACCTGACCGTCCCCCAGCCCGACACCCGTAAAGGGTCTGTGCTGAGGA 

GGATTAGTATACGAGGAAGGAACGCCTCTTTGCAGTTGAGACAAGAGGAA 

GGCATCTGTCTTCTGCCCGTCCCTGGGCAATGGAATGTCTCGGTATAAAA 

CCCGATTTTATGTTCCATCTACTGAGATAGGGGAAAACCACCTTAGGGCT 

GGAGGTGGGACATGCGGCAGCAATACTGCTCTTTAAGACATTGAGATGTT 

TATGTGTATGCATATCTAAAGCACAGCACTTAATTCTTTACCTTGTCTAT 

GTTGCAGAGACCTTTGTTCACGTGTTTATCTGCTGACCTTCTCTCCACTA 

TTATCCTATGACCCTGCCACATCCCCCTCTCCGAGAAACACCCAAGAATG 

ATCAATAAATACTAAGGGAACTCAGAGGCCGGCGGGATCCTCCATATACT 

GAACGCTTGTCCCCTGGGCCCCCTTATTTCTTTCTCTATACTTGGTCTCT 

GTGTCTTTTTCTTTTCCAAGTCTCTCGTTCCACCTAATGAGAAACACCCA 

CAGGTGTAAAGGGGCAACCCACCCCTTCATTGCTGATTTGTGAGCGTGCT 

TTAAGGTGAAAAAAGCATGAATGTTAACTTCCTTAAAAAGGTACAGCATC 

CAATTCAAATATTTTTGTCCTGATTTTAATGCTAGTTGATGTAGTGCTAT 

T AAAATTTTGTTCAACATGGACACAGAGAGGGGAACAACACATAC CAGGG 

CCTGTTGCGGGGTGGGGATGAGGGGAGGGAACTTAGAGGACAGGTGAACA 

GGTGCAGCAGATCACCATGGCCCACATATACCTATTTAACAAACCTGCAC 
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GTTCTGCAC^CGTATCCCATTTCTTTTTTTTTTTAAGAAATAGAAAAAAA 
AATAAAAT7TTGTTCACTGATTCTTCCATTTTAAAACTTGTTTGCATGTG 
GTTTAGGATGCCCTTACTTCAGCAAAGGAGAAGGAATAGGAGGGCCTTAG 
AATTTTTGAGGGAAAAAAACCCTATAACATACATTGTACTGTATCAAACT 
ATTTTACATGAATGACACAAGTATTCTGAATAAAAAAATAATTGAACATT 
GTTAAGAACAAGGTGTCATGTAATTTATTTTTCATAAATAAAAAAATTAT 
AGrGGCTTAGACTGAAAGGAACAGAGAATTTAAAAAATTAAAAAGAAGCC 
TTAGTATATTTTTGTATATAGTTTCCATGTGCCATATTTGCCATAATTGG 
ATGAGAATTTTTTGACCTCTGGCAGGGTGACCCTATATTTTCANTNTATA 
AAGCGTGCATCATACC 
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